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Design Considerations and Proposals for 


Compatible U. 5. Subsidiary Coinage 


By R. A. KIMBER and R. R. STOKES 


(Manuscript received August 16, 1965) 


For 173 years since the Coinage Act of 1792, United States subsidiary 
coinage has contained 90 per cent silver. During the last decade, an 
increasing unbalance between supply and demand has jeopardized the avail- 
ability of silver and made it necessary to consider alternative coinage mate- 
rials. Because present coin-handling mechanisms are designed to discrimt- 
nate between coinage and spurious materials, it 1s necessary to identify the 
functional properties of coins so that a compatible material system may be 
proposed. Compatibility is established by considering such items as weight, 
resistivity, diameter, thickness, wear, scrap, composition, coinability, cor- 
rosion and color. One alloy and four laminated metal systems, varying in 
silver content from 70 to O per cent, were found which satisfy the compati- 
bility requirements. This paper discusses the design considerations for 
compatibility and proposes five metal systems which meet these considera- 
tions. 


I. INTRODUCTION 


Coin silver, as used in U. S. subsidiary coinage, is made of an alloy 
containing 90 per cent silver and 10 per cent copper. During the last 
decade, it has become increasingly evident that the availability of this 
alloy for the continued production of coinage is jeopardized by an in- 
creasing unbalance between the demand for, and the supply of silver. 

The involvement of Bell Telephone Laboratories in this issue results 
from the use of 5, 10, and 25. cent coins in public telephones. In view 
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of the possible change in subsidiary coinage alloy, it became necessary 
to identify the functional properties of coins in their various applications 
and to specify accurately those properties which are essential to main- 
taining an operationally compatible coinage system. To this end, work 
was initiated on a study of coin properties and compatible metal systems 
for U.S. subsidiary coinage. 

This paper records (2) the results of an investigation of the permissible 
deviations from the present coinage system for compatible performance 
in coin chutes, (27) proposals for changes which represent improvements 
in the present coinage, and (777) the analysis and description of various 
metal systems which could be employed. This information has been 
discussed and confirmed with various representatives of the vending 
industry and conveyed to the U. 8. Mint and the Battelle Memorial 
Research Institute, consultant to the Mint, for their use. 


II. COIN CONSIDERATIONS 


2.1 Functional Requirements 


The selection of alternative metal systems to provide compatible U. 8. 
subsidiary coins involves a number of diverse considerations. There are 
practical questions relative to manufacture and procurement, psycho- 
logical questions of public acceptance, and functional questions of serv- 
iceability in coin operated machines. It is the purpose of this paper to 
identify the functional requirements for subsidiary coins and to present 
specific criteria for insuring compatibility. This is facilitated by first 
discussing those factors which pertain to coin operated mechanisms. 


2.1.1 Wezght 


The published weight limits on dimes and quarters as minted by the 
U. 8. Department of the Treasury are 38.58 + 1.5 grains (2.5 grams 
nominal) and 96.45 + 8 grains (6.25 grams nominal), respectively. 
Coins in circulation naturally weigh less due to wear. To establish the 
reduction in size, a study was made of the physical properties of cir- 
culated coins which included data on changes in coin weight, diameter, 
and thickness. The sample included 1000 coins of each denomination, 
pro-rated to have representative numbers by date for the number of 
each remaining in circulation. Probability plots of the data showed that 
the population distribution was sufficiently normal to permit statistical 
estimation of the minimum and maximum values. The three-sigma 
limits, which theoretically include 99.73 per cent of the population, 
showed that the weight of dimes could range from a maximum minted 
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weight of 2.6 grams to a low of 2.31 grams, while the weight of quarters 
could similarly range from 6.44 to 5.74 grams. 

In coin equipment, the lightest coin must have sufficient energy to 
operate one or more mechanisms. Since the quarter is two and a half 
times heavier than the dime, the half dollar is five times heavier, and 
each is required to accomplish similar functions, it is safe to assume these 
requirements will be satisfied for the quarter and half dollar if they are 
satisfied for the dime. To establish the lower acceptable weight for a 
dime, coins of various weights were deposited in new and existing coin 
mechanisms. The data show that dimes weighing less than 1.98 grams 
begin to malfunction and cause misregistration of coin deposit informa- 
tion. 

The wear study showed that a minted dime of 2.50 grams could wear 
to a lower expected weight of 2.31 grams, experiencing in circulation a 
maximum weight loss of 0.19 grams. Assuming the same wear char- 
acteristics for an alternate metal, the minimum minted weight of a 
new dime should not be less than 2.17 grams to avoid encountering 
worn dimes as light as 1.98 grams. The 2.17 gram dime weighs approxi- 
mately 87 per cent of the present nominal minted value of 2.5 grams. 

As previously mentioned, the weights of new quarters and half dollars 
are not critical for coin mechanism operation. Since most metals from 
which coins might be made have densities which differ from coin silver 
by less than 20 per cent, it is not considered restrictive to limit, ar- 
bitrarily, the weights of new quarters and half dollars to 80 per cent 
of their present weights. The reason for not reducing the weight further 
is to exclude the use of aluminum and magnesium in coins because of 
the inability to separate them by eddy current means. This will be 
discussed next. 


2.1.2 Acceptance Number 


Acceptance number is the nomenclature for a measure of a coin’s 
sensitivity to eddy current detection. The lower the number, the greater 
the sensitivity, although the relationship is not linear. 

An eddy current detector consists of a ramp on which a coin, or coin 
substitute, can roll through a magnetic field as shown in Fig. 1. When a 
coin rolls through the magnetic field, each incremental mass of the ma- 
terial instantaneously rotates about that point on the coin periphery 
which touches the ramp, and each cuts the transverse magnetic flux 
lines with some instantaneous velocity. By the “generator” rule, eddy 
currents flow in the coin perpendicular to the instantaneous velocity 
vectors. The lower the resistivity of the material, the greater are the 
eddy currents for a given velocity and magnet strength. 


496 THE BELL SYSTEM TECHNICAL JOURNAL, APRIL 1966 





REJECTED | REJECTED 
ACCEPTED 


Fig. 1— Relationship of coin, ramp, and magnet in an eddy current detector. 


The induced currents flowing in a conducting material have a subse- 
quent interaction with the magnetic field and give rise to mechanical 
forces as expressed by the ‘motor’ rule. The geometry of the ramp, 
magnetic field, and rolling coin are such that the resultant of the in- 
stantaneous force vectors is in a direction opposite to the translational 
velocity of the coin and functions to oppose the velocity and retard the 
coin. The greater the eddy currents, the greater the retarding, or braking 
force, and the more the coin is slowed down. On the other hand, the 
greater the density of the coin the greater is its momentum and the 
more difficult it is to slow the coin down with a given retarding force. 

Considering the joint effect of the coin material properties of resis- 
tivity and density on eddy current detection, a decrease in the value of 
either causes increased braking. The acceptance number of any mate- 
rial is defined as the product of its resistivity and density (cp). Conse- 
quently, materials with low acceptance numbers experience more re- 
tardation and are said to be more sensitive to eddy current action. 

The equation of motion for a homogeneous conductive disc rolling 
without slipping through a magnetic field is derived in Appendix A and 
is given below: 

dx rs ke’ dx 


de cp dt a= 0, . (1) 


COMPATIBLE U. S. SUBSIDIARY COINAGE 497 


where 


= the linear progression of the center of the coin, 

= the length of time the coin experiences the magnetic field, 
total magnetic lines of flux, 

a numerical constant, 

electrical resistivity, 

density, and 

a = the constant acceleration of a coin rolling down an incline. 


8 
I I ll | 


wast 
I 


A diagram showing the physical interrelationships of the members is 
given in Fig. 1. In eddy current terminology, the coefficient of the 
velocity term (dz/dt) is called the braking constant and is designated by 
K. Thus, 


k=—. (2) 


The coin material resistivity (¢) and density (p) appear in the denomi- 
nator as a product (cp) and is part of the rationale for so defining ac- 
ceptance number. As this number decreases, the braking constant 
increases, consistent with the earlier observation that decreases in ac- 
ceptance number result in increases in coin braking. Also, consistent 
with expectation, the braking constant increases with increasing magnet 
strength. Fortunately, the acceptance numbers of metals are suffi- 
ciently different to provide a basis for achieving discrimination between 
many of the common metals and U.S. coin silver. Table I lists a number 
of metals and their acceptance numbers. While the difference in ac- 
ceptance numbers is not ideal, it is adequate, when used in conjunction 
with weighing, to permit a high degree of slug rejection. 

To illustrate how physical separation actually occurs, reconsider the 
eddy current brake previously described as a ramp intersecting a trans- 
verse magnetic field (Fig. 1). If this ramp is terminated after braking 
occurs, so that coins can fall through free space, it is apparent that coins 
which have been slowed down more will assume vertical free fall closer 
to the end of the ramp. Coins which have been slowed down less will 
have more horizontal momentum and will travel further from the end 
of the ramp before assuming vertical free fall. 

In experimental studies it was noted that the free-fall path of coin 
silver alloy is closer to aluminum than it is to zine. This does not provide 
optimum spacing for obtaining separation from both metals. Since new 
metal systems are being proposed to replace coin silver alloy, it seems 
judicious to select one with a higher acceptance number to obtain opti- 
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TABLE [— ACCEPTANCE NUMBER OF TyPIcAL METAL SYSTEMS 





Electrical 





. area Densit Accpt. No. 
Material exstcnt fnierouie a) (grams/ cin’) (nicrohmn gm/cm?) 
Aluminum 2.8 2.7 7.5 
Magnesium 4.45 1.74 7.75 
Aluminum base alloy 5.5 2.64 15.1 
Silver, pure 1.60 10.5 16.8 
Magnesium base alloy 10.0 1.78 17.8 
U.S. coin silver alloy 2.1 10.3 21.6 
40-58-2 silver-copper-zinc 2.3 9.5 21.8 
70-27-3 silver-copper-zinc 2.25 9.8 22.1 
97.5-2.5 copper-nickel 2.5 8.9 22.5 
Zine 5.92 7.13 42.2 
70-30 brass 6.98 8.4 58.6 
Nickel 6.84 8.9 60.8 
(magnetic) 
Ingot iron 9.7 7.76 75.2 
(magnetic) 
Grade A phosphor bronze 10.7 8.8 94.0 
Columbium and alloys 13.1 min 8.6 122 min 
Lead 20.6 11.2 231 
18% nickel silver 29 8.7 252 
Titanium 55 4.6 253 
Zirconium 40 min 6.5 260 min 
75-25 copper-nickel 32 8.9 285 
95-5 nickel-silicon  - 38 8.55 325 
18-8 stainless steel 79 8.0 632 
90-10 nickel-chromium 80 8.7 696 
Incoloy alloy 800 92 8.03 740 
80-20 nickel-chromium 108 8.55 925 
Nonconductors co —_ 2 











mum rejection capabilities. Experimental studies indicate that a coin 
with an acceptance number of approximately 25 gives optimum rejection 
capabilities between aluminum and zinc. It is interesting to note that 
the Mercury dime had an acceptance number of approximately 25. 
With improvements in the refining of silver, the acceptance number of 
the Roosevelt dime has dropped to its present value of 21.6. This point 
is illustrated in Fig. 2, where the acceptance curve for the Mercury 
dime is closer to that of zinc than the Roosevelt dime. From the dis- 
cussion to ensue on compatible coinage metals systems, it will become 
evident that the option to adopt a different acceptance number can be 
realized in a number of different proposals. 

Even with the acceptance number of coin silver being closer to 
aluminum than zinc, the adequacy of eddy current coin chutes for sepa- 
rating spurious slug materials is readily demonstrated. A nationwide 
sample of over 4 million nonstandard deposits in public coin operated 
equipment which did not have eddy current coin chutes was collected, 
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PER CENT ACCEPTANCE 























0.400 0.450 0.500 0.550 0.600 0.650 0.700 0.750 
QIN INCHES 


Fig. 2— Acceptance vs coin position. (Eddy current magnetic field character- 
istics: @ = 1860 lines; gap = pass 0.058 inch, stop 0.060 inch.) 


sorted and identified. Of the total, approximately 73 per cent were un- 
redeemable slugs and the remainder were tokens and foreign coins. 
Excluding the sample of tokens, foreign and mutilated coins which are 
either partly or totally redeemable, the composition of the slug sample 
is shown in Table IJ. The information is presented in categories repre- 
senting the principal separation means in coin chutes. It is seen from 
this analysis that approximately 99.8 per cent of the sample would be 
rejected in eddy current coin chutes. 


2.1.3. Diameter 


The diameters of newly minted dimes and quarters are 0.705 + 
0.003 and 0.955 + 0.003 inches, respectively. The previously mentioned 
coin study showed that the diameter of dimes in circulation ranged 
between 0.698 and 0.706 inches, while those for the quarter ranged 
between 0.946 and 0.957 inches. In many coin mechanisms, diameter 
gauging is the principal means of separation. 

Further, discrimination against foreign coins with compatible alloys 
is achieved primarily on a diameter gauging basis. In view of these 
factors and the tremendous amount of coin handling, sorting, storing, 
and dispensing equipment which would have to be changed, it is not 
advisable to change the present minted diameters. 
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TaBLE II — MATERIAL Composition Stupy oF NONSTANDARD DE- 
Posits COLLECTED IN PusBiic CoIN-OPERATED MECHANISMS Not 
Usine Eppy Current CoIn CHUTES 





No. 
Nonstandard | Per Cent Of Total 
Pieces 


I. Would be rejected by eddy current 
coin chutes: 
Removed by magnetic trap: 
Iron 242,561 
Magnetic stainless steel 3,444 


246 ,005 246,005 | 46.65 





Removed by weighing: 





Aluminum 2,311 
Plastic 1,705 
4,016 4,016 0.76 
Removed by sizing: 92,041 | 17.46 
Removed by eddy current rejection: 
Lead 12,326 
Brass 99,788 
Zinc 13,522 
Nonmagnetic stainless steel (10¢ 
& 25¢ sizes) 14,011 
Tokens 5,906 
Other 38 , 628 





184,181 184,181 | 34.93 
526,243 | 99.80 99.80 





II. Would be accepted in eddy current 
coin chutes: 
Nonmagnetic stainless steel 1,036 0.20 
1,036 0.20 


527,279 100.00 








2.1.4. Thickness 


The thicknesses of newly minted dimes and quarters are 0.053 
(+0.005, —0.003) and 0.067 (+0.005, —0.003) inches, respectively. 
By the same criteria adopted for establishing minimum and maximum 
weights and diameters, the thicknesses of dimes in circulation range 
from 0.042 to 0.054 inches while those for quarters range from 0.053 to 
0.072 inches. 

In general, the thickness dimension is not a good measure of a coin. 
During use, mushrooming of the rim can cause increases in thicknesses; 
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in other cases, wear on the rim causes reduction in thicknesses. Further, 
many different foreign coins have substantially the same thicknesses. 
Thickness gauging is primarily useful in detecting mutilated coins, but 
is otherwise rather insensitive for regular gauging. 

Although the coin gauge of the present coin mechanisms provides 
thickness gauging, the control is not so close that a small increase in 
thickness could not be tolerated. This may be necessary to help main- 
tain the coin weight in a substitute metal system. Coin silver alloy has a 
density of 10.3 grams/cm’ while most of the metals which might feature 
in a new system have lower densities. Two principal contenders, copper 
and nickel, each have nearly equal densities of approximately 8.9 grams/ 
cm’, Especially in the case of the dime, some change will have to be 
made to maintain the required weight. 


2.1.5. Wear 


Due to the nature of coin handling, coins wear primarily in thickness. 
Pearson product-moment correlations obtained from a sample coin 
study on coin wear indicates that there is a +0.46 correlation between 
coin weight and thickness, and practically no correlation between coin 
weight and diameter for coins in circulation. Because the wear occurs 
principally on the coined surfaces, it is apparent that a percentage 
change in coin thickness does not produce the same percentage change 
in coin weight as would be experienced with a solid disc. Over the range 
of expected coined surface wear, the loss of weight is estimated to be 
approximately one third of what would be experienced with a solid disc 
having the same thickness reduction. This was determined by compar- 
ing the coin weight losses with changes in coin thicknesses. 


2.2 Criteria for Compatibility 


From the previous section on factors pertaining to coin operated 
mechanisms, it is possible to identify a set of numerical constraints on 
coinage which forms the basis for an objective definition of compatibility. 
These constraints call for maintaining the present diameter, establishing 
a tolerance range for the acceptance number, and specifying values for 
the minimum weight and maximum thickness. The controlling physical 
dimensions for material and compatibility with U. 8. dimes, quarters, 
and half-dollars are given in Table III. 


2.3 Laminated Coinage 


From the previous discussion on the relative uniqueness of the ac- 
ceptance numbers of various metals, it is not surprising that the number 
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TaBLeE IJ] — PuysicaAL CHARACTERISTICS FOR COMPATIBLE 
U.S. SussipiAry COINAGE 











10¢ 25¢ 50¢ 
Diameter (inch) 0.705 + 0.003 | 0.955 + 0.003 | 1.205 + 0.003 
Acceptance number (mi- +1 +1 +1 
crohm-gm/cm?) 25 "3.5 25 3.5 2535 
Weight (grams) 2.17 min 5.00 min 10.0 min 
Thickness after coining (inch) | 0.061 max 0.072 max 0.091 max 





of existing alloys which satisfy the eddy current considerations is lim- 
ited. When additional requirements like appearance, corrosion, and 
manufacturability, to be discussed later, are added, the number of 
possible choices is even more restricted. In view of this, attention has 
been directed to combining various metals in a laminate structure and 
utilizing the different material properties to achieve the equivalent 
acceptance number of coin silver alloy. One can get an intuitive under- 
standing of the approach by considering a low resistivity metal, which 
permits high eddy currents and provides braking action to oppose the 
coin motion, laminated to a high density metal which provides inertia 
to assist the coin motion. By establishing the correct laminate thickness, 
it is possible to control the resulting acceptance number. This approach 
has the additional attraction of using common metals in an uncommon 
way to offer protection against counterfeiting. 

The equation of motion for a laminated coin rolling without slipping 
through a magnetic field, given below, was derived and experimentally 
verified as presented in Appendix A. 


EC) 
ax E® t=1 Oj a = 9= 0, (3) 


> (ae 


where 


7; = thickness of the 7th lamella, 
o; = resistivity of the 7th lamella, 
p; = density of the 7th lamella, 

n = number of lamellae, 


and all other terms are the same as defined in (2). 
A comparison of this equation with that for the homogeneous coin 
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shows that it differs only in the braking constant. Since the two equa- 
tions have the same form, they have the same general solution. If the 
two braking constants were constrained to be equal, the equations would 
have the same specific solution and the velocities of the homogeneous 
and laminated coins leaving the magnetic field would be equal. This is 
the condition for compatible eddy current materials. When the two brak- 
ing constants are equated and all common terms are cleared, the fol- 
lowing equation results and provides the constraint relation on the sec- 
tion thicknesses (7;) of the laminated coin to achieve the equivalent 
acceptance number (cp) of a homogeneous coin. 


n 
2 PiTi 


(oper = * aeoes (4) 


i=1 Oj 





This equation has been verified experimentally and used in the design 
of the laminated coins proposed in a later section for compatible coin 
substitutes. 


2.4 Additional Coinage Considerations 


2.4.1. Scrap 


One important consideration in the production of coins is the efficient 
utilization of scrap. Present punch press operations produce a material 
stock skeleton containing 30 to 35 per cent scrap. This is reclaimed by 
melting, casting, and rerolling. The same procedure can be employed 
for any single element or alloy substitute. 

In the use of laminate strip stock, the scrap reclamation problem 
places two constraints on the choice of the metals for laminate sections. 
First, when laminated scrap is melted, all of the constituent metals in 
the various laminate sections appear in the resultant alloy. Because 
refining is too costly, it is necessary to be able to enrich the reclaimed 
stock by adding one or more of the constituent metals to produce an 
additional quantity of one of the required laminate alloys. Second, in 
the enriching process it is necessary that the reconstructed alloy be 
produced in quantities that can be completely used to prevent stock- 
piling. . . 

These two constraints lead to two conclusions. First, the constituent 
elements used in one laminate section must appear in the second lami- 
nate metal which is planned for reconstruction. Second, the percentage 
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of the reconstructed laminate used in the coin must exceed the percent- 
age of scrap produced. In the simplest case, should one laminate be a 
pure metal, like zinc, the other laminate material must be a zinc bear- 
ing alloy and must constitute at least 35 per cent of the coin volume. 
The specific details of these constraints will be discussed for the proposed 
laminate metal systems in Section ITI. 


2.4.2. Composition 


From an eddy current view, the metallic elements must work into 
some metal system which achieves the required acceptance number and 
remains essentially non-magnetic. Should the proposed metal system 
require alloying, some of the most promising pure metals like silver and 
nickel cannot be used together except in very small quantities because 
of the tendency to crack during rolling. Lead is excluded in laminated 
structures as a high density replacement for silver because it fails to 
bond. The phase relationships of copper-nickel and copper-aluminum 
alloys cause the resistivities of both to rise radically above the resistivity 
of either pure metal and limit their utility. 

In addition to the scrap problem previously discussed and the con- 
siderations of corrosion, coinability, and color, to follow, the materials 
should be in good supply at reasonable cost. Zinc, copper, and nickel 
are all attractive in this regard. 


2.4.3. Coinability 


Two of the principal factors which contribute to the appearance of a 
coin are the amount of relief and sharpness of the coined surfaces. By 
these criteria, American coins rank among the finest in the world, and 
there is strong interest in maintaining the present high standard. To 
achieve this objective, it 1s necessary to employ a mctal system which 
has an adequately low initial hardness and which will not work-harden 
during coining by an amount which would prevent complete filling of the 
characters on the coined surfaces. 

Presently, coin blanks have to be capable of having an initial hardness 
of 28 to 29 Rockwell B in the annealed condition and not rising above a 
hardness of 76 to 77 Rockwell B in the fully-coined condition. This places 
a double constraint on the selection of coin substitute metals. 

Alloys which might be considered for laminated coins, such as stain- 
less steel, silver-copper-nickel alloy, copper-zinc, and silver-copper-zinc 
alloys having more than 30 per cent zinc, are too hard in the annealed 
condition for coining. Other choices for coin substitutes which are coin- 
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able like the silver-copper and copper-nickel alloys, change rapidly in 
hardness with the addition of certain trace elements used to control 
the alloy resistivity. Specifically, the addition of 0.1 per cent silver to 
copper-nickel alloys raises the hardness from 26 Rockwell B to about 
60 Rockwell B and 1 per cent silver raises the hardness to 70 Rockwell 
B. Such factors are important with respect to both manufacturability 
and acceptability. 


2.4.4. Corrosion 


A further constraint is placed on the choice of materials by corrosion. 
Any metal system which is acceptable for subsidiary coinage must nei- 
ther tarnish in a manner to change appreciably the appearance of the 
coin after minting, nor produce corrosive residues which could soil the 
property of a possessor, or become dislodged in coin mechanisms and 
eventually cause malfunctions. 

In the case of laminates, additional care must be exercised to insure 
that the adjacent laminates are adequately close in the electromotive 
series to minimize rim corrosion. In the use of dissimilar materials, it is 
to be expected that some corrosion will occur. However, the effects of 
this at a low level are adequately offset by the effective polishing which 
coins experience in use. 


2.4.5. Color 


Since the coin silver alloy used in higher value subsidiary coinage has 
been ‘‘white,” or “silvery,” since the first coinage act in 1792, the Ameri- 
can public is accustomed to white coins. How far a new metal yan 
can depart from this standard is a subjective question. 

Silver-copper alloys with decreasing silver content take on a pro- 
gressively increasing pink tone which finally becomes a rich bronze color. 
A number of independent subjective evaluations set a lower silver con- 
tent limit, concluding that at or above a 70 per cent silver content, the 
alloys look subjectively similar to the present coinage. The American 
public has also grown accustomed to the material appearance of the U.S. 
five cent coin which is 75-25 Cupro-Nickel. While this alloy has a differ- 
ent ‘‘whiteness,” it is apparently acceptable for a non-silver standard. 
It is felt that any new metal system should retain white coined surfaces. 


III. METAL SYSTEMS FOR COMPATIBLE COINAGE 


This section includes five proposals for metal systems which provide 
eddy current compatibility with existing U.S. silver alloy coinage. One 
basic alloy and four laminate solutions are presented. 
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3.1 Alloys 


In surveying the extensive number of existing alloys for possible 
compatible systems, the most severe requirement — correct acceptance 
number — was initially used. This criterion was applied to a listing of 
alloys presented in The Bureau of Standards Publication, The Mechanical 
Properties of Metals and Alloys. Alloys found which satisfy this require- 
ment are: 


(¢) aluminum alloys 
(7) magnesium alloys 
(iit) copper-zine alloys with less than 5 per cent zinc 
(iv) copper-nickel alloys with less than 3 per cent nickel 
(v) silver-copper alloys 
(vt) silver-copper-zine alloys with less than 5 per cent zinc. 


The copper-aluminum alloys are excluded here because resistivities of 
the alloys are high, yielding unacceptably high acceptance numbers. 

There are other criteria which further reduce the possibilities for 
acceptable alloys. The aluminum and magnesium alloys do not have 
sufficient weight to dependably rotate the coin separating devices in 
existing coin operated mechanisms. The copper-zine and copper-nickel 
alloys are coppery in color and tend to tarnish. Therefore, these alloys 
are eliminated from further consideration. 

The silver-copper alloy meets the weight requirement for all percent- 
age compositions. Fig. 3 shows the variations of resistivity-density, and 
acceptance number for different amounts of silver. The acceptance num- 
ber is everywhere between 21.5 and 23 microhm-gm/cm? over the range 
of 26-74 to 94-6 silver-copper. As previously discussed, the color of the 
silver-copper alloys changes from a bright, shiny white as the silver con- 
tent is reduced, requiring that all alloys having less than 70 per cent 
silver be eliminated because of the color requirement. 

Although zinc is added as a whitener, the silver-copper-zine alloys 
have much the same properties as silver-copper alloys. Since its corro- 
sion properties are the same, the silver-copper-zine series must contain 
at least 70 per cent silver to prevent corrosion. Therefore, the only 
compatible alloy system found is comprised of silver-copper alloys 
which contain more than 70 per cent silver. 


3.2 Laminates 


A laminated coin has a more complex set of constrains than an alloy 
coin as noted in the coin considerations previously reviewed. A large 
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Fig. 3 — Resistivity and density of silver-copper alloys. 


number of metal systems were investigated, and this section presents 
those few combinations which satisfy all of the restrictive conditions. 

In the interest of achieving the simplest coin to fabricate, each lami- 
nated coin to be presented consists of three lamella sections. In each 
case, the two outside lamellae are white alloys to satisfy the color re- 
quirement, and are the same thickness to insure symmetrical eddy cur- 
rent performance. The core in each case is a copper alloy. 


3.2.1 70-30 Silver-copper sections laminated on a 80-70 silver-copper 
core. (Representative) 


In the discussion on alloys it was mentioned that all silver-copper 
alloys with silver contents between 26 and 94 per cent have acceptance 
numbers exceeding 21.5. The lower limit on silver content was set at 
70 per cent for reasons of color and corrosion. This produced a coin 
material with an acceptance number of 22.5 which contained 70 per 
cent silver. 

A laminated coin could be made with 70-30 silver-copper alloy on the 
outside layers for reasons previously cited, and reduced silver content 
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alloy used in the core to achieve an over-all lower silver content in the 
coin. 

The lower limit for the over-all silver content is determined by the 
minimum acceptance number. Assuming a dime coin blank thickness of 
0.039 inch and minimum practical thicknesses of 0.007 inch for each 
outside laminate, the lowest silver content which can be used in the core 
and maintain an acceptance number of 21.5 is 20 per cent. This gives 
the coin an over-all silver content of 37.9 per cent. 

Using this approach, it is possible to obtain a family of coins by 
letting the core contain higher silver than the 20 per cent minimum. 
For purposes of illustration, if a coin bearing 50 per cent silver were 
required, Fig. 4 would apply. To illustrate its use, if it is given that 
the outer laminates should be 70-80 silver-copper, the designer is free 
to select either the section thicknesses or the per cent silver in the core. 
For example, if the center core thickness is desired to be half of the total 
thickness, the core must contain 30 per cent silver. In this series of pos- 
sible coins, there is no scrap problem since the melted scrap can be 
divided into two parts and each enriched to the desired value. 


90-10 Ag-Cu 
OUTER LAMELLA 
\ 


a 
aN 
80-20 Ag-Cu_ 

OUTER LAMELLA 7 





“~-__ 70-30 Ag-Cu 
OUTER LAMELLA 











REPRESENTATIVE 
SAMPLE 








ie) 20 40 60 80 100 
PER CENT SILVER IN CORE 


Fig. 4— Laminated coin proportions. (Ag-Cu, Ag-Cu, Ag-Cu laminate. 50 per 
cent silver.) 
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3.2.2 70-27-38 Silver-copper-zinc sections laminated on a 90-10 copper- 
zinc core 


The second laminated coin to be considered consists of two 70-27-3 
silver-copper-zine outer sections laminated on a 90-10 copper-zine core 
and contains 44.5 per cent silver. The addition of zinc to copper in a 
ratio of one to nine increases the acceptance number to 34.5. The pres- 
ence of zinc in the core complicates the reclamation problem by re- 
quiring that zine also appear in the outer laminate material as discussed 
earlier in the section on scrap. Further, it needs to appear in the ratio 
of one to nine with the copper to simplify the melt-enriching process 
without causing stock piling. Alloys of 90-9-1 and 80-18-2 silver-copper- 
zine were tried in addition to the proposed 70-27-3 composition. All work 
equally well from an eddy current view, but the 90 per cent silver alloy 
definitely leads to a scrap build up and the 80 per cent silver alloy situa- 
tion was marginal. With the 70-27-3 alloy in the outer laminates and 
the 90-10 copper-zine core, a coin is achieved which is completely com- 
patible when the section thicknesses are properly controlled. 

To visualize the effect of section thicknesses on the acceptance num- 
ber, one might imagine the center thickness becoming increasingly thin, 
in which case the acceptance number would equal that of the outside 
material. Conversely, with increasing thickness of the inside section, 
the acceptance number approaches that of the center laminate. Depend- 
ing on the thicknesses of the sections, the acceptance number of the coin 
could assume any value between the two limiting values of the constit- 
uent laminate alloys. To achieve the desired acceptance number of 25, 
it is obvious that one laminate material must have an acceptance num- 
ber which is lower and the other material one that is higher. Obviously, 
as the outside laminates wear, the influence of the inner laminate in- 
creases and the composite acceptance number shifts towards that of the 
core material. 

To determine what the section thicknesses should be to obtain a de- 
sired acceptance number, use is made of (4) which expresses the con- 
straint relation. Since each coin consists of only two laminate materials, 
and since the cut blank thickness for a coin is constant (to), it is conveni- 
ent to express the thickness of one laminate in terms of the other. In 
this case where the core material has the higher acceptance number, 
the inner laminate thickness is called t2 , and (4) takes the form 


Gp [er (to — te) + pate] 
E — te) 4. 4 (5) 


Oo, On 
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where 


pi = density of the lower acceptance number material, 

px = density of the higher acceptance number material, 
oy = resistivity of the lower acceptance number material, 
resistivity of the higher acceptance number material. 


OH 


Since (5) is a function of only one independent variable, te , it is possible 
to plot a curve for the acceptance numbers which result from varying 
to. The resulting design chart is shown for this metal system in Fig. 5. 

Since the cut blank thicknesses (to) for dimes, quarters, and half 
dollars are different, three separate design curves are shown for the three 
coin denominations and labeled tp) = 0.039, tp = 0.052, and t) = 0.066, 
respectively. To obtain a coin acceptance number of 25, one has only 
to read the core material thicknesses, t2 , from the chart. The laminate 
thicknesses for the three denominations are given in Table IV. 

Before leaving the design chart, it is appropriate to consider the change 
in acceptance number with wear. As a laminated coin wears, the outside 
sections are reduced in thickness while the core thickness is unchanged. 
This alters the ratio of the original thicknesses and causes a shift in 
the acceptance number. As the lower resistivity material on the out- 
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Fig. 5— Laminated coin proposal no. 2. (70-27-3 Ag-Cu-Zn [R = 2.25 x 10° 
ohm-cm, D = 9.8 gm/cm*] and 90-10 Cu-Zn [R = 3.92 ohm cm, D = 8.8 gm/cm’].) 
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Tas_Le IV 
Coi 70-27-3 90-10 70-27-3 
O1p. Silver-Copper-Zinc Copper-Zinc Silver-Copper-Zinc 
ty hh 
— fe (center) — 
2 * 2 
(in) (im) (in.) 
10¢ 0.0125 0.014 0.0125 
25¢ 0.0165 0.019 0.0165 
50¢ 0.0207 0.0246 0.0207 








side wears away, the acceptance number increases. Conversely, the 
acceptance number decreases if the denser material is used on the out- 
side and wears away. To show this effect, a wear curve was constructed 
by holding the core thickness constant and calculating the acceptance 
number for reduced clad thicknesses. 

Referring to the wear curve of Fig. 5, the ratio of combined outer 
laminate thicknesses to inner laminate thickness, 4/t., is 1.785 for a 
newly minted 10-cent coin and this value on the wear curve coincides 
with an acceptance number of 25. From the study on coin wear, the 
maximum reduction in dime thickness was estimated to be 0.012 inches. 
Recalling that the weight decreased at one-third the rate of the thick- 
ness reduction, the maximum dime wear would result in the equivalent 
solid disc loss of 0.004 inch total, or 0.002 inch per side. Since this wear 
occurs on the two outer laminate sections, it reduces the combined 
outer section thicknesses ¢; to 0.021 inch, and the t/t. ratio to 1.50. 
Referring to the wear curve, this change would cause the acceptance 
number to rise to 25.2, which is an insignificant change. The wear 
limits for the dime and quarter acceptance number change differ as a 
function of the relative amount of wear and the initial thickness ratio. 
No wear curve is given for the 50-cent piece since wear data are not 
available. 

As mentioned previously, the coins under discussion here contain 
approximately 44.5 per cent silver. The specific compositions of the 
three subsidiary coins are given in Table V. 

The following calculation shows that there is no scrap build up with 
these coins. Assuming there is 100 pounds of laminate material before 
the coins are blanked, there will be 35 pounds of scrap to be melted. 
For the dime, this 35 pounds of scrap will consist of 15.75 pounds of 
silver, 17.32 pounds of copper, and 1.93 pounds of zine. To achieve a 
new alloy in the same proportions as the outer lamella (70-27-3) only 
silver needs to be added to the melt, since the copper to zinc ratio re- 
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TABLE V—CoINn CoMPOSITION 











10¢ 25¢ 50¢ 
Q Silver 45 44.4 44.0 
% Copper 49.5 50.0 50.4 
% Zine 5.5 5.6 5.6 
Total 100.0 100.0 100.0 








mains nine to one. The weight of outer material obtained from the 
addition of silver is found from the following ratio 


_ (% Copper in melt) (35 lbs scrap) _ 17.382 _ 


eee SFC = 64.2 Ibs. 
ue % Copper in outer layer 0.27 RS 
The weight of the outer lamella in 100 pounds of laminate is 
_& _ 0.025 _ 
Wo = é xX 100 = 0.039 (100) = 64.2 Ibs. 


Since the two weights are equal, there will be no scrap build up. 


3.2.3 40-50-5-5 Silver-copper-nickel-zinc sections laminated on a copper 
core 


The third proposal for a compatible metal system consists of two 
outer laminates of 40-50-5-5 silver-copper-nickel-zine alloy bonded to a 
core of copper. The 40-50-5-5 silver-copper-nickel-zine alloy is com- 
monly known as Swedish Coin Silver. It is used in five Swedish coins, 
has an attractive white appearance and has fair coinability. As an alloy, 
the material has an acceptance number of 57.2 which is too high. As an 
outside lamella of a laminated coin with a copper core, it is an attractive 
material proposal for achieving an acceptance number of 25. The coin 
contains 19.9 per cent silver. 

The design and wear curves for this coin were constructed as in the 
case of the previous coin, and are plotted in Fig. 6. In this case, since 
the material with the lower acceptance number is used in the core, the 
curve is plotted with the thickness, t; , as the independent variable. 

The coin meets all of the cited requirements and is attractive as a 
coin of low silver content. The section thickness for each denomination 
is given in Table VI with a listing of the various coin compositions. 


3.2.4 75-25 Copper-nickel sections laminated on a copper core 


This fourth proposal consists of two 75-25 copper-nickel outer sections 
laminated on a copper core and contains no silver. The 75-25 cupro- 
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Fig. 6— Laminated coin proposal no. 3. (40-50-5-5 Ag-Cu-Ni-Zn [R = 6.05 
ae ony D = 9.45 gm/cem*] and Cu [R = 1.74 X 10° ohm-cm, D = 8.94 
gm/em*}. 

















TasBLe VI 
40-50-5-5 Silver- 40-50-5-5 Silver- 
Copper-Nickel-Zinc Copper Copper-Nickel-Zinc 
Coin A ta (center) A 
2 (in.) "2" eae 
(in.) (in.) 
10¢ 0.0097 0.0196 0.0097 
25¢ 0.0132 0.0256 0.0132 
50¢ | 0.0167 0.0326 0.0167 
COIN COMPOSITION 
10¢ 25¢ 50¢ 
g, Silver 19.9 20.3 20.2 
% Copper 75.1 74.6 74.8 
% Nickel 2.5 2.55 2.5 
% Zine 2.5 2.55 2.5 





Total 100.0 100.0 100.0 
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nickel is presently used in the coining of U.S. five cent pieces. It is an 
easy material to fabricate, is adequately white and has an acceptance 
number of 286. Because of this high value, it must be fabricated with 
thicker sections of a core material with a low acceptance number to 
achieve the desired acceptance number of 25. The acceptable composi- 
tion percentages are given in Table VII. 

The loss of outer Jaminate thickness due to wear has a more pro- 
nounced effect on the change in acceptance number than for the other 
laminates. Because of the faster relative change of the thickness ratio, 
the acceptance number is reduced after maximum wear to 22.5. This is 
not considered objectionable because the acceptance number remains 
within the proposed limits for compatible coinage. Experimental results 
with this coin composition confirmed the expectation that it yields 
better rejection than does present coin-silver. 


IV. SUMMARY 


After examining the properties of U. 8S. silver alloy coins and the 
operating requirements of typical coin handling mechanisms, it was 
concluded that the coin diameters should not be changed. The dime, 
quarter, and half-dollar minimum newly minted weights should be 2.17, 
5.00, and 10.0 grams, respectively. The coined thicknesses of the quarter 
and half-dollar should be retained, and the dime thickness could be 
increased to 0.061 inch maximum to more easily achieve the weight ob- 
jective. 

Further, an acceptance number of 25 with a maximum and minimum 
limit of 26 and 21.5, respectively, would permit a distinct improvement 
in the ability to reject slugs. 

It is possible to design a number of substitute metal systems which 
have eddy current compatibility with U. 8. silver alloy coinage. After 
identifying the considerations which enter into the design of compatible 
coinage, five proposals have been made which satisfy these conditions. 
They differ in silver content from a high of 70 per cent to a low of 
zero, offering freedom in the final selection. 


TasBLE VII— Corn ComposIrion 





10¢ 25¢ 50¢ 
% Nickel 9.9 10.1 10.2 
% Copper 90.1 89.9 89.8 


Total 100.0 100.0 100.0 
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TaBLE VIII — CompatTisBLE CoIn PROPOSALS IN ORDER OF MERIT 





% Nominal 








Accpt. No. f : 
Order of Merit Metal System pee After Max: Advantage Os a 
inlimum 

1. 70-27-3 Silver-Copper-Zine Laminates 25.0 25.2 7.7% 
on 90-10 Copper-Zine 

2. 40-50-5-5 Silver-Copper-Nickel-Zine 25.0 23.0 3.8% 
Sections Laminated on a Copper Core 

3. 75-25 Copper-Nickel Sections Lami- 25.0 22.5 0.7% 
nated on a Coppor Core 

4, Silver-Copper Alloys With at Least 70 22.5 22.5 12.9% 
Per Cent Silver 

5. 70-30 Silver-Copper Sections Lami- 22.1 21.6 9.7% 


nated on a 30-70 Silver-Copper Core 





From a coin-operated mechanism point of view, there are three 
factors which provide a basis for comparing the five material systems: 
weight, acceptance number, and resistance to changes in acceptance 
number with wear. Because the minimum weight requirement is achieved 
with all five proposals, the ranking is based principally on the ability 
to obtain an acceptance number of 25 and to resist changes therein with 
wear, and secondarily, on the ability to obtainas much dime weight above 
the minimum as possible. Judging by these standards and using the 
dime data as a basis of comparison, it is possible to rank the five com- 
patible coin proposals in an order of merit as listed in Table VIII. 


V. ACKNOWLEDGMENTS 


These proposals for compatible substitute coinage and the supporting 
technical information were submitted to Miss Eva Adams, Director 
of the U. 8. Mint. 

Appreciation is expressed to members of the U. 8. Mint, the Battelle 
Memorial Institute, the Metals and Controls Division of Texas Instru- 
ment Company, and the Bell Telephone Laboratories’ Metallurgical 
Engineering Department for their cooperative assistance. 


APPENDIX A 
Derivation of Eddy Current Equations 


A.1 Solid Disc 


The equation of motion for a homogeneous disc, rolling without 
slipping through a magnetic field, was derived by Messrs. L. Veith and 
C. F, Wiebusch using classical mechanics considerations and issued in 
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an internal Bell Telephone Laboratories memorandum dated Septem- 
ber, 1940. The essentials of these equations have, subsequently, been 
confirmed by derivations based on energy considerations and dimen- 
sional analysis. Because the latter technique affords a simple and 
direct approach to the equation of motion and emphasizes the important 
parameters from the onset, it is used in the following derivation. The 
authors are indebted to Mr. J. P. Runyon for suggesting its use in this 
paper. 

Consider a homogeneous conductive disc rolling down a ramp without 
slipping and cutting a transverse magnetic field. The parameters, 
governing the motion of the disc, are defined below. 

6 = angle of inclination of ramp. 
linear progression of the center of the disc. 
magnetic lines of flux of the transverse magnetic field. 
density of the disc. 
electrical resistivity of the disc. 
initial acceleration of the disc. 


I 


| 


I 


aqrves 
ll 


Il 


The basic equation of motion for the dynamic system represented 
here is of the form 
dt 4 Ke + Ky = 0 (6) 
de 1 Ae 2 3 = U. 
An inspection of the equation shows that the constant, K;, must have 
the dimensions of an acceleration, hence it is the initial acceleration, a, 
of the disc which is opposite to that due to the eddy current action. 
Further, since the disc is moving through a constant magnetic field, 
it is assumed that the motion is not a function of the displacement, x, 
and that the coefficient Ke is zero. These considerations reduce the form 
of the equation to 
dx 
dt? 
Since the first and third terms have the dimensions of acceleration, 
[LT~*|, where L is length and T' is time, the second term must also have 
the same dimensions. Further, since the dz/dt part of the second term 
is a velocity with the dimensions of [L7'—'], the coefficient Ki must have 
the dimensions of [7-']. In eddy current terminology, the coefficient of 
the velocity term, Ki, is known as the braking constant. 
From physical considerations, the braking constant is a function of 
the magnet and disc properties, specifically, the magnetic lines of flux, 
disc resistivity and density. This can be expressed in the form 


Kia ¢’p'a. (8) 


dx _ 
+h -a=0. C7) 
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Since ¢ has the dimensions of [J7Z?/TQ] where M represents mass units 
and @ represents charge units, p has the dimensions of [1//L'], o has 
the dimensions of [ML*/TQ?] and knowing that their product must have 
the dimension of [7], (8) can be written in the form 


H-(IETEST- 


B = MEM PENN 7 25-3743), (10) 


Equating exponents on both sides of the equation leads to the following 
four equations: 


Recombining 


—-§6-vA=-l (11) 
Tale dhe ee eae (12) 
—6— 2. =0 (13) 
26 — 3y + 3A = 0. (14) 
From the simultaneous solution of (11), (12), and (13), 
A= -l 
6 = 2 
y= -1. 


Substitution in (8) yields the braking constant 


2 


ky a a ° 
op 
Therefore, 
2 
Ky = k a ? (15) 
op 


and from (7) and (15), the equation of motion for a homogeneous dise 
rolling without slipping through a magnet field is 
dx dx 
—+k———a=0. 1 
dt? op dt : (16) 
This equation assumes that the entire face area of the conductive disc is 
uniformly influenced by the magnetic field. In practice, this is rarely the 
case. Consequently, the braking constant is typically modified to account 
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for the area and position of the disc relative to the area and position of 
the magnetic field. In addition, the edge effect of the dise entering and 
leaving the field must be considered. These factors are generally deter- 
mined from geometric considerations and empirically obtained results 
in specific applications. In no event do these modifying coefficients 
reduce the generality of the basic equation of motion, given in (16). 


A.2 Laminated Disc 


The equation of motion for a laminated disc, rolling without slipping 
through a magnetic field, has the same basic form as that for a solid 
disc. An examination of (16) shows that all of the dise parameters appear 
in the braking constant. The specific form of the equation of motion 
for the laminated disc can be obtained by substituting appropriate 
expressions for the laminated disc resistivity and density in the braking 
constant. To facilitate this step, it is convenient to express the braking 
constant in terms of the dise mass and resistance. 

The resistance of the conductive dise to eddy currents can be obtained 
from the general resistance equation 


al 
ae (17) 


With the coin center velocity parallel to the ramp and the magnetic 
field transverse to it, the direction of current flow is normal to the ramp 
along conductive length 6; , the height of the disc equal to the magnet 
dimension normal to the ramp. The cross-section area normal to the 
current is the product of the disc thickness, 7, and the width of disc 
equal to the length of the magnet, parallel to the ramp, b2. Hence, 

ea (18) 
T be 
The density of the solid disc is simply p = mass/volume = m/A,r. 
Substituting these expressions into the braking constant of (15) yields 


x = — et LA 
. (Fem) (2) ~~ Rmbs * (19) 
bi Ajt 


To obtain the appropriate expressions for the mass and resistance of a 
laminated coin, one can immediately write the mass equation 








my = A-(piti1 + pote + *°* Patn) = A.D (pits) (20) 


where A, = face area of the coin. 
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The total resistance of the laminated dise consists of the resistances 
of each lamination added as parallel resistors. 








di 1 
| a a CET 
fe. a 1 1 (21) 
me ee oe, ae, 
where 
— oxb1 
BUS Tide 
b, 1 
hee = — 
"be ye (22) 
i=1 O74 


Substituting in (19), the braking constant for a laminated disc takes 


the form 
ke > (“) 


o% 


Kiein = — — (23) 
X, (pits) 
Hence, the equation of motion for the laminated disc is 
. wee) 
ug SN 2%) (24) 


dt? dt 


> (p:T:) 


A.3 Equivalence of Solid and Laminated Discs 


Since the equations of motion for the homogeneous and laminated 
discs have the same form, they have the same solution and will differ 
in specific trajectories as a function of the braking constants only. The 
functional equivalence of the homogeneous and laminated discs can be 
obtained by equating the braking constants of (15) and (23), yielding 


2 = Ti 
we (EE) 
ap ” , 

Dy Pits 


i=l] 


(25) 
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Equation (25) reduces directly to the desired constraint relation. 


= t=1 C7 ; (26) 





Equation (26) is in terms of the physical and material properties of the 
coins only, and establishes the equivalence between the two coins. 
While the resistivity-density product of a homogeneous disc is independ- 
ent of thickness, it is readily seen that the value for the laminated coin 
is not. The expression correctly reduces to an identity for the case of 
a single laminate. 

Putting (26) in a slightly different form, 


n 
Dy pits 
i=1 


(op)err = — mus (27) 





i=1 O07 


it is apparent that the effective resistivity-density product for any 
laminated coin can be calculated with this equation. 


AA Experimental Verification of Equations 


To verify the equivalence of the laminar resistivity-density product, 
it was necessary to fabricate laminated coins using (27) and to test their 
dynamic performance against homogeneous coins of known parameters 
in an eddy current coin-sampling device. For this purpose, it was de- 
cided to build a laminar coin to simulate the resistivity-density product 
of zinc. Zinc was selected because it represents the next highest accept- 
ance number above coin silver that the chute normally rejects. 

The resistivity of zinc, determined with a Magnaflux Conductivity 
Meter (Model FM-100), is recorded in Table IX, along with the meas- 
ured density. The acceptance number of this material is given as 42.3. 


Taste IX 
Paes : Acceptance Number 
* Resistivit: D 
Material x 10-* alii cm alan x 10-8 ane phan ene 
Copper 1.74 8.86 15.4 
Zine 6.0 7.06 42.3 
Phosphor Bronze (Grade A) 9.0 8.86 79.7 
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TABLE X 
. Resistivity Density Thickness 
Material X< 10-6 ohm cm gm/cm3 inch cere tae) 
xX 1076 
2 2 
1 2 o1 o2 pl p2 it le ere-obaa/cu 


Phosphor Bronze | Copper 9.0 | 1.74 | 8.86 | 8.86 | 0.039 | 0.011 41.5 








An examination of (27) indicates that the two materials selected to 
simulate a given coin must have acceptance numbers on each side of the 
desired value. For this reason, phosphor bronze (Grade A) with an 
acceptance number of 79.7 was laminated with copper to produce a 
zine disc equivalent. The particular laminar thicknesses used in the 
simulated test discs are given in Table X, where ?¢2 is the thickness of 
the center laminate and t; is the combined thickness of the two, equal 
outside laminates. The tolerance on the three laminar section thicknesses 
was held to 0.0005 inch each and the resulting calculated acceptance 
number of the clad disc was within 1.9 per cent of the homogeneous 
coin value. This deviation is considerably smaller than the discrimina- 
tion capability of the eddy current test instrument. Therefore, the lami- 
nated discs were judged to be adequate test samples for comparative 
dynamic testing. 
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Fig. 7— Acceptance vs coin position. (Magnet strength: ¢ = 1800 lines; 6 = 
20°; gap = pass 0.058 inch, stop 0.060 inch; coin diameter = 0.705 inch.) 
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It should be emphasized that the acceptance number of the laminated 
coin was calculated with the equation this experiment was designed to 
verify. The 1.9 per cent agreement, arrived at above, cannot be inter- 
preted as a demonstration of equivalence. Satisfactory verification 
requires that the laminated coin exhibit the same dynamic behavior in 
the eddy current chute as the coin it was designed to simulate. 

To evaluate the eddy current behavior of the laminated coins, a stand- 
ard coin chute with the eddy current magnet adjusted to the nominal 
recommended flux strength of 1800 lines was mounted vertically. The 
accept mechanism was initially adjusted toward the front of the chute 
to insure that all deposited coins would be rejected. By progressively 
adjusting the mechanism toward the rear of the chute and dropping 
each coin and laminate equivalent 100 times at each setting, the results 
were obtained, as shown in Fig. 7, with the per cent acceptance plotted 
as a function of the distance, d, from the end of the run. 

While there is some difference at the end points in the transition from 
low to high per cent acceptance, the mid-value acceptance characteristics 
of the solid coin and laminate disc equivalent approximate each other 
within the experimental accuracy of the test. This supports the con- 
clusion that (27) is correct and provides an equivalence between single 
laminar and multiple laminate coins. 

The end point transitional effect, referred to in the last paragraph, 
manifests itself as a sharper acceptance characteristic for the laminated 
disc. This means that the discrimination properties of this coin in eddy 
current detectors are superior to the homogeneous coin it simulates. 
Stated differently, the variation in trajectories which the laminated coin 
exhibits after leaving the run is less than for the companion homogeneous 
coin. This suggests that the laminated coin could be used not only to 
obtain a compatible eddy current coin alloy, but also, to achieve im- 
proved rejection characteristics. 
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Bounds on Communication with 


Polyphase Coding 


By A. D. WYNER 
(Manuscript received December 23, 1965) 


The theoretical capabilities of a “polyphase” coding-modulation scheme 
with additive white Gaussian noise are studied. The channel capacity of this 
system ts found and the error exponent estimated. Bounds are also found on 
Ro(pmax), the maximum (asymptotic) rate for which polyphase codes can 
be found with maximum correlation between code words pmax - 


I. DEFINITIONS AND PRELIMINARIES 


We shall consider the following (‘‘polyphase’”’) coding-modulation 
system (schematized in Fig. 1): 
Every T seconds the message source emits one of M equally likely 


messages. The information rate is R = 1/T In WM nats per second. 
Corresponding to the 7th message (¢ = 1, 2, --- , 47) the coder emits an 
n-vector X; = (a1, Vi2,°°* , in), Where 


7 Se Sr 


=> ? 


eo, 2p se (1) 


and where the integer n will be specified later. The time interval [0, 7'], 
during which this information must be transmitted, is divided into n 
equal subintervals of length 7/n. During the kth of these subintervals, 
the modulated signal is 


si(t) = V/28 cos (wt + 2x), (kb — 1) * kT 


= a Ey 
n (2) 


k= 1,2,---,n. 


Thus, we have employed phase modulation with carrier frequency w, 
radians per second and average power S. 

We assume that the noise is additive, white, and Gaussian with one- 
sided spectral density N.. The receiver must examine the received 
signal y(¢), the sum of s;(¢) and the noise, and determine which of the 
M messages was actually transmitted. It is well known that (since all 


923 
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Fig. 1 — Polyphase coding-modulation system. 







y(t) = 
S,(t)+2(t) 






MESSAGE 
SOURCE 


signals are equally likely to be transmitted, and have equal energy ST’) 
the optimal decoder (which minimizes the average error probability ) 
selects that signal s;(¢) which maximizes p;, the (normalized) correla- 
tion between s;(¢) and y(t): 


— 7 [ s:(t)y(4) dt. (3) 


Let us remark at this point that the correct operation of the decoder 
depends on its exact knowledge of the possible transmitted signals, so 
that in particular all delays and distortions to which the signals are 
subjected in transmission must be known exactly by the receiver. This 
is a so-called ‘“‘coherent”’ receiver. 

We let P.; equal the probability that the decoder output is incorrect 
given that message 7 was transmitted, so that the average error prob- 
ability is 
M 
ys P. ets (4) 


= 


P. = 


i 


_ 


Now, the same channel is to be used by a number of users simultane- 
ously, each at a different carrier frequency. Let W cycles per second be 
the separation of carrier frequencies between adjacent users (W will be 
taken as the ‘“‘bandwidth”’). Then the carrier frequency for the ath user 
(a an integer) is w = a2aW radians per second. Further, we shall set 
n = WT (let us say that 7 is such that WT is an integer), where 7 is 
the number of subintervals defined previously. With w. and n so chosen 
and the signals constructed as in (2), it is easy to show that the signals 
of the ath and 6th (a ¥ @) users are orthogonal on the interval [0, 7). 
Hence, the presence of the signal due to the @th user does not affect the 
correlator in the decoder of the ath user. 

Let us say that the transmission rate R and the bandwidth W are 
held fixed, and let 7’, the duration of the signals (hence n = WT7'), become 
large. Every 7’ seconds the message source will produce one of M = e*” 
equally likely messages to which the coder must assign an n-vector. The 
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channel capacity C is the maximum rate for which we may make P, 
vanishing small for T sufficiently large. Formally, for any R < C and 
€ > 0, there is a 7 sufficiently large so that the transmitter may transmit 
one of M = e*” messages with P, < e. (This will necessitate a set of 
M = e”“" n-vectors stored in the coder.) The channel capacity C of this 
coding-modulation scheme is found in Section IIT. 

Let us consider again the decoding scheme. Making use of the fact 
that the p; of (3) are normally distributed random variables, it is possible 
to write an expression for the error probability P.* which depends only 
on the signal energy to noise ratio ST’/N, and the matrix of normalized 
inner products among signals 


1 sf ca 
Pi = oT : s;(t)s;(t) dl, 4jJ = 1,2, ate? ,M. (5) 


From (2) we obtain 


pe cos (tix = vik); 1,J = 1,2, mre (6) 


wim 


It is known} that the error probability P,. (as given in (4)) using the 
optimal decoder may be bounded by 


P. & f(max p;;), 
ixi 
where f(x) is an increasing function of x. Accordingly, a reasonable 


procedure for designing good coding systems would be to try to make 
Pmax = Max p;; aS small as possible. Alternately we pose the problem as 
ivi 


follows: 
With W, 7, pmax held fixed, what is the largest rate for (7) 
which we can design codes with parameters W, 7’, pmax? 
Let us observe that from (6) 
pi = 1 — ps [1 — cos (ain — xy)] 
(8) 


a Sle 


gin? (tun — Xj) = d(x; , X;) 
522s 2 =e 2n 


* Ref. 1, (2.11). 
Tt Ref. 1, (4.7) and Ref. 2, p. 498. 
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where the ‘‘distance”’ d(x; , x;) is defined by 


n 2 
&(x:,x;) =>. 2 sin Ga = 2a) (9) 
k=1 2 
Thus, a code with maximum p;; = pmax , has minimum d’ (x, , x;)/2n = 
(1 — pmax). In the light of the above, we shall reformulate the problem 
as follows: 


Let @, be the space of real n-vectors x = (21,%2,°-:,2n) (where 

n = WT’) which satisfy 
—riuS 47, k= 1,2,---,n. (10) 
Let x = (a1,°°:,%,) and y = (y,°-:,Yn) €@n, and define the 


distance between x and y as 


d(x,y) = b (2 sin |, (11) 


It will be shown in Section IV that d(x,y) is, in fact, a metric. A code 
is a set of M members of @,, {x; = (ta, tin, «++, Xin) }ie1. The 
transmission rate is R = 1/n 1n M nats per symbol. The transmission 
rate in nats per second is R = (1/1) n M = WR. We will define M (n,d) 
as the maximum number of code vectors in an n-dimensional code with 
minimum distance between pairs of code words d. Then R(n,d) = (1/n) 
In M (n,d), and R(n,d) = (1/7) ln M (n,d) are the corresponding trans- 
mission rates. A problem equivalent to that of (7) is the determination 
of R(n,d). In Section IV we shall let n (and hence 7’) become large 
while the ratio 8 = d’/2n is held fixed (corresponding to a fixed pmax) 
and estimate R(8) = lim R(n, ~/2n8) by upper and lower bounds. 


Since 6 = 1 — pmax, R(L — pmax) is the (asymptotic) maximum rate 
for polyphase coding with max pi; = pmax - 
iq 


II. SUMMARY AND DISCUSSION OF RESULTS 


The channel capacity is shown in Section IIT to be 


C= w|-[ i) in) dy +n 24], (12) 
where 
A = 8/N.W (12a) 
is the signal-to-noise ratio, and 
Fle) = 2pAe *°*PT9(2pA), (12b) 
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and J, (x) is the modified Bessel function of vth order. Another formula 
for C is (93). Approximate formulas for C' for large and small values of 
the signal-to-noise ratio A are obtained in Appendix A. For large values 
of A, | 


= vIn (* 4) + (A), (13) 
where ¢:(A) > 0as A — o. For values of A close to zero 
C = WIA + 0(4’)). (14) 


The capacity C is plotted versus the signal-to-noise ratio A in Fig. 2. 
Estimates of the optimal achievable error probability are obtained in 
Appendix D. 

The upper and lower bounds on R(@) are expressed in terms of the 
function Co(é) which is defined as follows. Let & be chosen 


0<é81, 
then define \(£) as the (unique) solution of 


















































7.5 10.0 
A=S/NoW 


Fig. 2— The channel capacity C vs the signal-to-noise ratio A = S/N.W — 
(12) (solid line). (Curves A and B are the approximations to the capacity C for 
large and small values of the signal-to-noise ratio A, respectively — (13) and 
(14). Curve C is W In (1 + A), the capacity of a channel with bandwidth W and 
no restriction on the modulating scheme. 
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_T, Bate) 
p=|1 i Te |. a 


The existence (and uniqueness) of the solution to (15) is established in 
Appendix B, A graph of \(&) versus é is shown. in Fig. 3. The function 
Co (£) is then defined as 


Co(€) = —In Io(A(E)) + (1 — E)AE). (16) 


A graph of Co(é) versus € is shown in Fig. 4. Our bounds on R(8), 
which are obtained in Section IV (and plotted in Fig. 5) are 


Co(8) S RB) S Co(r’s), (17) 
where Co(é) is defined in (1) and 


(TeV B). (18) 


The lower bound is of the same type as the Gilbert bound for binary 
coding, and the upper bound makes use of the Blichfeldt density method.* 
Let us remark that the upper and lower bounds of (14) agree when 
6 = 1, yielding R(8) = 0 for 8 = 1. When B is small it is shown in 
Appendix FE that 


Y= 


ae 


1 In — + (8) = R(B) = 41n at &2(8), (19) 


1 
rep wep 

















fe) 0.25 0.50 0.75 1.0 


Fig. 3 — The function A(é) vs — — (15). 
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Oo 0.25 0.50 0.75 1.0 


Fig. 4 — The function Co(t) vs £— (16). 


where & , £2 > 0 as 8 > 0. Thus, for sufficiently small 8, R(8) is within 
4 In 2 of 4 In (2/meB). 

In terms of the modulation scheme discussed in Section I it is more 
revealing to rewrite inequalities (17) in terms of pmax the maximum 
correlation between pairs of signals. Let Ro(pmax, W, 7.) = Ro(pmax, T') 
be the maximum rate (in nats per second) attainable for the polyphase 


P wax 


0.50 
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~~ 1.0 
ed he! 


ss 
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B 








Fig. 5 — The upper and lower bounds R (8) vs 6 and pmax — 1 = 8 — (17). 
Rk (ay. lies in the shaded region. 
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modulation scheme of Section I with parameters pmax, W, and 7. Let 
Rislpiss)- = limit Ro(pmax , 7’). In the light of comment following (9), 


Rolpuss) = Rl a — Pmax)]. The upper and lower bounds on R.(pmax) 
are plotted versus pmax in Fig. 5. 

Appendix F contains a comparison of the capabilities of this polyphase 
system and another important modulation system. _ 


III. CHANNEL CAPACITY 


The signals s;(¢), 7 = 1, 2, --- , M, are of the form 


si(t) = ~V/28 cos (a2tWt+ rn), (k—- i) -<t< 
(20a) 
k= 1,2,---,n, 
where n = WT and 
—a7 SUn So, k= 1,2, ---,2. (20b) 
Alternately, we may write 
si(t) = aa” cos a2cWt + rx” sin a2r We, 
kT (21a) 


(k-Dis i k = 1,2,---,n, 


where 
tin”? = V28 cos xin, te” = +/28 sin vin . (21b) 


The noise function z(t) is a sample from a white Gaussian noise 
process with one sided spectral density N. (so that the covariance is 
R(r) = (N./2)6(r)). The received signal is y(t) = s;(t) + z(t), where 
si(t) is one of the M signals. The optimal decoder computes 


1 T 
pi = aa J si(t)y(t) de, 
and decodes y(t) as that s;(t) with largest p;. If y(t) is the received 
signal, let - (t) be 

y*(t) = cos a2r Wt + Yn sin alr Wt, 


(22a) 
(k-1)> s si<%, es eae 
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where 
kT/n 
n® = 20 | iC) cow wei ai (22b) 
(k-1)(T/n) 
and 
kT/n 
n° = ow | y(t) sin a2r We dt. (22c) 
(k-1)(T/n) 


We may think of y*(t) as the projection of y(£) onto the space of allow- 
able signals. It follows by direct computation that 


t ¢* 2 
ww | si(t)y*()dt, 


the correlation of y*(¢) and the 7th signal s;(é) , equals p; . Thus, without 
loss of generality, we may consider the received signal to be y*(t). From 
(21) and (22), it suffices to consider the noise to be 

z(t) = y(t) — s(t) = 2 cos a2tWt + 2 sin a2cWt 


(23a) 
(ai spe. Rei Aen 
n n 


where 
a? = 4? — 2g and 2° = 4° — ta”, 
(23b) 
Kees 12, oh 
From (23b), (22b), (21b), and (20a) we may write 
kT /n 
a? = aw | (y(t) — si(é)) cos on Wt dt 
(k-1) (T/n) 
kT/n (24) 
= 2W z(t) cos alxWet dt, k = 1,2,---,n, 
(k-1) (T/n) 


1L- . : . . 
so that z,."” is a normally distributed random variable with mean zero 
and variance 
kT/[n 


E(@°”) = 4W’ 
(k—-1) (T/n) E 
kT/n (25) 
| cos a2n Wt (cos a2aWr)a(hya(s) at dr, 
(k-1) (T/n) 
where the over-bar denotes expectation. Since z(#)z(7) = R(t — 7) = 
(N./2)5(t — 7), the variance of z” is N,W. Similarly for 2. 
Further, E (z(°2,”) = 0, and 


Elen, ze) = 0 (7 =1,2) if by xh. 
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Thus, these random variables are independent. 

We conclude from the above that our channel is equivalent to the 
following time-discrete memoryless channel. Every T/n = 1/W seconds, 
the channel input is a real number Xe[—7,z]. The output is a pair of 
numbers Y, and Y2 given by 


Ze => XxX + Ly - Yo = Xo + Zo (262) 
where 
Xi = V/ 28 cos X, Xe = 4/28 sin XxX, id (26b) 


and Z, , Z, are independent normally distributed random variables with 
mean zero and variance N = N»oW. Consequently, known results for 
determining capacity may be used. 

If an input probability distribution is specified, the mutual informa- 
tion of the input and the output is 


1(¥1,¥2;X) = H(¥1,Y2) — H(¥1, Y2|X), (27) 
where H(Y;, Y2) is the joint uncertainty of Y; and Ye and 
AY s Vol) 


is the conditional uncertainty of Y;, Ye given X. The channel capacity 
C, in nats per second is 


C = Wimax I(Y;, Yo; X)], (28) 


where the maximization is performed over all possible input distribu- 
tions. We proceed to find C. __ = 
Say X = a, and let 1 = +/28 cos 2, x2 = +/28 sin z, then 


H(Y; ) Yo | pe = x) 
Gin - ite (29a) 
= | . dydyog(y1 — 1, Y2 — X2) In gly: — 21, y2 — 22), 


where 


g(21, 22) = es exp [— (21 + 2:)/2N] (29b) 
Q2rN 


is the joint probability density of Z, , Z.. After changing the variables 
of integration and integrating (29a), we obtain, 


H(Y,, Y2|X = x) = In 2reN, (30) 
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independent of x. Thus, 
H(YiY2|X) = In 2reN, (31) 


independent of the input distribution. 

Thus, to find C, we must maximize H(Y1, Ye). Say po(x) is the 
probability density of the input X, and p(y, ye) the resulting joint 
probability density of the output pair (Y1, Y2). If we characterize the 
output pair by polar coordinates (@, ®), then the corresponding den- 
sity for ®, ® is 


fe(re) = rpe(rcosg,rsing), r2od, —7t SoS, (82) 


where r is the Jacobian of the transformation. Hence, 


+0 +e 
H(Y¥1Y2) - | ply, yo) In poly, y2)dyrdyr 


od Tv 
= / [ piu(7 COS y, Yr Sin g) 
r=0 “7 


‘In [pr(7 cos yg, r sin g) |r dr dy 


nae i a ful) 
-| LF poloe) nha dr dy (33) 


= [ E fo(r,e) In fi(r,¢) dr de 


i I np ; fu(rg) In r dr do 


H(@®) + i fi(r) Inr dr, 
0 
where H (@, ®) is the joint uncertainty of ®, ®, and f(r) is the marginal 
density of R. Now 
H(®,%) S H(@)+ H@), (34) 


(where H(®), H(&) are the uncertainties of ®, ®, respectively) with 
equality if and only if ®, & are independent, and 


H(@) S In 27, (35) 
with equality if and only if @ is uniformly distributed on the interval 


[—2,7]. Hence, from (83), (84), and (85), 


H(Y1,Y2) S H(Q) +In2er + [ ne) In r dr. (36) 
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We shall now find fi(r), the density of ®, and show that it is inde- 
pendent of the input density po (2). To begin with, let us say that X = z. 
Then the joint density of (Y,, Y2), given that X = zx is 


a = y) 


(37) 


=5 — exp {—[(y — V/2S cos x)” + (ys — 28 sin x)"|/2N}. 


The joint density of Y1, Y2 or the corresponding joint density of R, ® 
is obtained from (37) by averaging over 7: 


flr) = rpi(r cos 9, r sin ¢) 


= rf po(x)pu(r cos y,7r sing|X = x)dzx 


® 1 
= Le pola)dae 5 


“exp {- on [r cos ¢ — V/28 cos a)’ (38) 


+ (rsing — V/28 sin i} 


— 1 tte) sen [ 
= ay re x po( 2) 


“exp (x= cos (x — °)) dx 
N 
Now, the marginal density for R is obtained by integrating ¢ out of (38) 


filr) = [ falrse)ae 





ne (39) 
age aan rv/28 cos (x = v) 
= sy | def dx po(x) exp (aoe) ' 
Interchanging the order of integration, we get 
—(r2+4+28)/2N 
f(r) = Sg 2 polx)ae [ dy exp rv2s cos (gy — 2) 
(40) 





i (25) 
N sae ON a fad 
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independent of po(z).* We conclude from (40) and (36) that 


Cea Ae ee [ao ni) dr +In2n, (A) 


Po (zx) 


where f(r) is given by (40). 
Let us now say that the input distribution is po(w) = 1/27. Then 
from (38) 


1 
flr) = filr) 37? (42) 
vis 
so that ®, ® are independent with the marginal density of ®, fo(e) = 
1/27. Thus, in this case, the equalities in (84) and (85) and hence in 
(36) hold yielding 
H(¥1,¥2) = -f[ fr) mE dr +n (Qe), (48) 


so that (41) is satisfied with equality. From (28), (30), (41), and (438), 
the channel capacity C is given by 


ba = — [ ne) ee dr — IneN. (44) 


If we set p = 7/+/2S and A = S/N = S/N.W, the “signal-to-noise 
ratio’’, we obtain 


z = — [io inte) dp + n 24 ; (45a) 
where 

f(p) = 2Ape 4**Y I (2Ap). (45b) 
Iv. BOUNDS on R(8) 
4.1 Upper Bound on R(8) 


We need the following two lemmas: 


* We shall make frequent use of the formula 
1 Tv 
Ip(x) = / er cos 8 dé, 
2r JL » 


which can be found in Ref. 4, p. 79. 
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Lemma 1: Let g:,g2,°++* 5 Gp be real numbers. Then 
“ 2 
Lo = 
k=1 
Proof: From the Schwarz inequality 


63 1 -o) < (= r) bs a) =pdoe. (47) 


(Zu). (46) 


Slr 


A 


k=1 


Lemma 2: Let {x;};-1" be a set of m n-vectors from GQ, with minimum 
distance d between pairs of vectors. The distance ts given by (11). Let y be 
an arbitrary vector in Gn, and denote by d; the distance d(x; , y). Then 


(= «) iii (3 a 4G GES) c <0, (48) 


Proof: Let us define a mapping of @, into Hs, , Euclidean 2n-space, as 
follows. If x = (21,22, °°: ,%n) € @x , then the corresponding 2n-vector 
igx’ = (uw, 01, U2, 02, °°* y Uns Un) Where 


Un = COS 2, Ve = SIN a, ae aa i! ng 8 (49) 


° . , 
Then letting x1, x. ¢ @,, and letting x = (Wn, vn, Ue, RB, °°; 
y . 
Uin » Vin) ANd X_ = (tgs , Var, Wer Voz *** 4 Won, Von) be the corresponding 
members of He, , the distance between x; and x: is 


#(x1, x2) = 2 sin {eu — au) 


k=1 


ee (50) 
= pa { (un = Uz)” + (vie = von)’ }. 
To see this we need only observe that if the wp, voe,k = 1,2,---,n 


are considered as arc lengths on a unit circle with center at the origin, 
then (a, vn) and (dex, vo) are the Cartesian coordinates of x, and 
Xox , respectively, (see Fig. 6). The quantity 2 sin [ (ain — 22x)/2] is then 
the Euclidean distance between (tix , viz) and (Uex , ver). Hence, d (x1 x2) 
is the Euclidean distance between x; and x... This also provides a justi- 
fication for calling d(x,y) a metric. We are now in a position to prove 
the lemma. 

Without loss of generality we may take y = (0, 0,---,0) so that 
y = dq, 0, 1, 0, es , 1, 0). Let x; = (Wan, Vin, Wiz , Via, aie , Uin » Vin) 
then 


di = d(xi,y) = > {(1 — win)” + va}. 


k=1 
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(24, - Lok) 





Fig. 6 — Proof of Lemma 2. 


Since d(x;,x;) 2d 


(3) d< SS d’(x; ;x) = > > { (wiz — Use) + (ve — Vin) } 


i<7 i<j k=1 


-Elnbw- (Eu) 


t=1 


rae-(S)] 


t=1 i=1 


=) {m (3 1 — ua)*) 


—(m- Dua) +m Do vue 


4 


-(Ex)] 


=m es 2 {(1 — win)? + va} 


(51) 
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> (= (1 — ua) 


-E (Em): 
( 


smDai—~¥ (Ea — ua), 


=1 a 


cow 


From Lemma 1, (51) becomes 


=) d <m 2, d;' — (> > (1 — a): (52) 


k=1 i=1 


: z 2 
Now, since uz, + v;° = 1, we have 


dj 


D> {= we)? + 02} = DO 1 — Que + ua? + 027} 
i E (53) 


2 > (1 — wiz). 


Substituting (53) into (52) yields 


(Gyenze-k(Euy. oe 


The lemma follows on multiplying both sides of (54) by 4/n. 


Derwation of the Bound: 
If z ¢ G, let us define the “sphere” S(z,p) as 
S(z,p) = {xe Qn: d(x,z) < p}. (55) 


Since the distance d defined on @, is a metric, it follows that if a code 
{x,};." has minimum distance (as defined by d), then the spheres 
S(x;, d/2) are disjoint. 

Consider the maximum size n-dimensional code with minimum dis- 
tance d and M (n,d) code words {x;};-.”. Consider the spheres S (x; , yd) 
about each code word, where 


‘= z(t Le eB) (56a) 
B = d'/2n. (56b) 


Note that since y > 4(0 S$ 8 S 1),* these spheres are not necessarily 


* This follows immediately when we write y? = 1/1 + (1 — 6)"?, so that y in- 
creases from 1/4/2 to 1 as 8 increases from 0 to 1. 
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disjoint. To each point in the sphere at distance r from the center assign 
a density o(r) = yd’ — r*. Then the “mass” of each sphere is 


i / (Ya — av, (57) 
r<yd 


where the integration in (57) is performed with respect to the Euclidean 
measure, assigned to @, in the obvious way. 

In general, a vector y ¢ @, will belong to the spheres about m code 
words say Xi , X2, °°: ,Xm. We assign to y, a density equal to the sum 
of the densities contributed by each sphere, i.e., 


m 


oy = >, o(d;) = myd — Da, (58) 
i=1 


i=l 


where d; = d(y,x;). If y belongs to no sphere c, = 0. Thus, we have 


mass of @, = / aylV = M(nd)-p. (59) 


Y€Qn 


We will bound M (n,d) by finding an upper bound on the mass of @, . 
Letting s = sy = o,/n, (58) becomes 


2 2 72 
Dadi a =: 7 = 2my'B — s, (60) 
where 8 = d’/2n. Substituting (60) into (48) we get 
(Qmy'B — s)” — 4m(Qmy’B — s) + 4(m) (m—1)68 $0. 61) 
Rewriting (61) 
0 < s' S m{48 — 2mBQy'B — 47 + 2) — 48(1 — 7B)}. (62) 


With y chosen by (56), 27*8 — 4y’ + 2 = Oand1 — ye> 0, so that 
(62) can only be satisfied if 


s= 2 <8/(1 — 78) = K(p). (63) 
Hence, from (63) and (59) we have 
M(nd) = 2 Fl gj,dV S K(6)n (Volume of @,). (64) 
Kb "@n be 


Now from (57) 





b= / (yd? — 9 )dV > dV = V,(Wyd? — 1) (65) 
r<yd/2 


Tay 2Q2-1 
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where V,(r) is the volume of the sphere in @, S(z,r), which is inde- 
pendent of z (due to the symmetry of @,). Thus, (64) becomes 


— nK(B)(2r)” 


The asymptotic rate R(8) satisfies 


R(g) = limit ~ In M(n, ~/2Bn) 
= (66) 
= ime In a IES a 2x) a 


noon Vn(/2y?Bn — 1) Rv (8). 


Applying the result of Appendix C we have R(8) < Co(y7’8) establishing 
the upper bound. 


4.2 Lower Bound on R() 


Again let us consider a maximum size n-dimensional code with mini- 
mum distance d and M (n,d) code words. About each of the code words 
x:(2 = 1, 2,---,M) consider the spheres S,(x;,d). We claim that 

M 


the union of these spheres U 5S, (x;,d) covers the entire space @, . 
i=l 


This follows from the fact that if xo ¢ @, is in no S,(x;,d), then 
d(Xo,x:) 2 d,7 = 1, 2,---, M, so that x» may be added to the code 
destroying the maximality. If V,(d) is the volume of S,(x:;,d) (Gnde- 
pendent of x;), then 


M-V,(d) = volume of @, = (27)”. (67) 
Thus, our lower bound is 


(2)” 





> 
M(n,d) 2 V.td)’ (68) 
The asymptotic rate R(8) satisfies 
a ee | —— ere | (Qr)" Ag 
R(B) = limit — In M(n,-V 26n) = limit — In —-——— =R, (6). (69) 


Again applying the result of Appendix C, we have R(8) = Co(B) estab- 
lishing the lower bound. 
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APPENDIX A 


Asymptotic Estimates of the Channel Capacity 
The channel capacity C is given by (12) as 


= —| Ho) mt ap + m4 (70) 
0 p e€ 
where 


f(p) = 2Ape 4%? Ty (2Ap). (71) 


In this appendix we obtain estimates of C for large and small signal-to- 
noise ratio A. 


A.1 Large A: We show here that 


C4, 4rA 
ae 2In—— + a(A), (72) 


where ¢,(A) —~0as A — o. To prove this we will show that for large 
A nearly all the contribution to the integral in (70) is for p in the neigh- 
borhood of unity. Part (7) is an estimate of this contribution. Part 
(72) shows that the remaining contribution vanishes as A > . 

(<) We shall show that if 6 = A™}, 


T(A) & ae “Ho) wl® ap = >} (73) 


he 
A ? 


as A o@, 
Using the asymptotic formula for Ip (x) for large argument” 


ble) = Ge E +0 (4) |, (74) 


we obtain from (71) (for large A) 


seo) = 4/4 dese" eo(F)] 1 -ssest+e (75) 


Substituting into (73) yields (after a change of variable) 


ma) =|-1+0(4 1) |B + Be+ Ba (76) 


* Ref. 5, p. 86. 
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; 2 = 
B, = i (/3 e4"(1 + 2)' In (/4 
—§ Tv Tv 


; _ 
B, = | (/2 ce“? (1 +2)! In (1 +2)! de (76b) 
L-§ T 


6 17 
= ig = e481 + 2)'0 (3) de, (76¢) 


Noting that the range of integration is —6 S x S 6 we can write 


; = es 
K, | 4/2 eo ln (/2 e 4” da (77a) 
—§ Tv T 
ri) A ‘ 
Ke i 2 Se dt SK | (77b) 
Bz = Ko(4 ) [ Ve —Azt dx <= K;0 (1) (77c) 


where (1 — 5)? < Ki, K; $ (14+ 8)’, < (1+5)'In(1+5)' and 
6 = 1/ A’. From (77b) and (77c) we see immediately that B,, B; —- 
0 as A — ~ so that we need consider only B,. From (77a) (letting y = 
4/2Ax) and setting 6 = A, we have 


A es oe K, Maes iy en 
fl 2 —¢-9 —_ — 7 
B, = 4K,In =n oe € dy Lye an” dy. (78) 


Since both integrals in (78) and K, tend to unity as A — », we have 
B,— 4 ln (A/ze) as A > &. Applying these results to (76) yields 


lim 7(A) = lim (1 +0 (4) (3 in 4. Be: Bs) =i tee 
A->00 A we A 


where 


ae ae (76a) 


tas) 


B, 


B, 








which is (73). 
(77) Here we shall show that with 6 = A‘ as in (7) above, 


nA) af f(p) ee ae ey eee (79) 
is p 


1+6 


> 
VIIA 


p 


To do this we write 


(A) = [fe ) inte) dp + ile ) nt) g 
(80) 
ae i f(p) inf? ap =1+C4+ C3, 
146 p 
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where a(O < a < 4) is arbitrary. We will show that for arbitrary ¢« > 
0, we can choose A sufficiently large so that 7(A) < e. Let us consider 
each of the integrals C; , C2 and C3 of (80) in turn. 


< a we may write 
(81) 


C,:For0 s 
f(e) < 2Ae“*I)(2Aa), 


since Ig (a) is an increasing function of x. Making use of the asymptotic 


formula for I9(x) (74) we obtain from (81) 


fe) < y/4 co (1 40(4)) 0, as A> 
amr A 


Thus, with A sufficiently large 











since a < 4 
Or ple) Egy OK<pK<a 
ed Qa 
and 
|\CGi|s peur a =, (82) 
= 3a? 3 


C. : Again using the asymptotic formula for Jo(2) (74) we may write 
fora SpZ1- 6, 

(5) eV ee" (1+ 0G) 

(83) 


se = yf herr 
im 4 / A oo (1 +0(4 1) +0 as A — o., 


Thus, with A sufficiently large 
[xe inte) < Ze, 


from which 
IGalstef pds. (34) 
C3; : As above, we may write for p 2 1 + 6, 
He) _ fee (1 +0 (5)) (85) 
Tp A 


p 
Substituting (85) into the defining integral for C3; (80), and making 


change of variable y = (p — 1), we obtain 
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=f V/A a+ pre ea in 4/4 dy 


+f Aad mie mat uytay 
ac [ 3 (1 + y)'e 4" In e4” dy 

: A 4 Ay? 1 
“Lyuawtore(eoa) 


Since for y = 6, (1 + y)' < 2c” and [a+t+y)Ind+y)7?| s &, 
we have from (86) 


ICs| S i (/2 ger ’ in 4 +2+0 ()| dy 
+ / (/4 eA Ay’) dy. 
ri Tv 


Using the well known asymptotic formula for the cumulative error 
function, and the fact that 6 = A‘, it is readily shown that for large A, 


= A —(A—l)y? oy 1 —At 
I i/ se dy © Taare ; (88a) 


(86) 


(87) 


and 


‘ A avy? 2 a ae —at 
i (/4e6 (Ay) dy © Tee (88b) 


Equations (88a and b) tell us that with A sufficiently large |C3| < 
Ej 3. 
Taking the above results together yields 


ln(A)| S (Ci) +[Q|4+(Cs| se 


with A sufficiently large. 
(4it) The final step is to substitute (73) and (79) into (70) and ob- 
tain 


oo in® + n7@ = ym", as A—> 0, 


which is what is to be proved, (72). 
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A.2 Small A: We show here that 


C 
eet at AC 
iv A(1 + 0(A)), (89) 
as A — 0. 

Substituting (71) into (70) yields, after a bit of straightforward 
manipulation, 


7 a eee ode | pe“? I(2pA)dp 
0 
(90) 


_ ade | pe “”” Io(2pA) In Io(2pA )dp. 
0 


If we change the variable of integration to x = 2pA, we obtain from 
(90) 


C =A 


¥ - € a 3 ~22/4A 
iv A + cad xIo(x)e dx 
et ANS (91) 
_ a [ alo(x)e 4 In Iy(x) dx. 
Now the first integral of (90) is known” and is 
| aIo(a)e* 4 dx = 8A7(1 + A)e4, (92) 
0 
so that 
=e eee [ tl(a)e” 4 In In(x)dx = 2A — cl (93) 
Ww DA dg ° oy an 


We can estimate the integral D for small A, by noting that most of the 
contribution is for small 2. Making use of the asymptotic formula for 
Iy(x), for small x 
2 
Ifa) = 1+ rt + 0(z"), (94) 
we have 


D= [ E + oa") Jew dx. (95) 


* Ref. 6, p. 198, (4a). 
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Since 
o 3 
LC —22/4a4 5 2 
I ae dx = 2A’, 
and 
‘d ae” 4 dx = 64A°T(3) = 0(A), 
0 
we have 
D = 2A°(1 + 0(A)). (96) 
Trom (96) and (93) we get 
& = 24 — Ae (1 + 0(4)) = AI + 0(4)) 


which is what was to be proved (89). 
APPENDIX B 


The Function d (€) 
In this appendix, we show that for & satisfying 
0<é<1, (97) 
there exists a unique \(é) which satisfies 


TOE) ) 





eer O) a 
If we define the function £ (A) by 
_, _ AQ) . 
E(A) = 1 T(x)? OSA < ~, (99) 


it will suffice to show that 
(4) &(A) is strictly monotone decreasing, 
(7) £0) = 1, 
(it) lim €(A) = 0. 
A700 


If (¢), (2), and (227) are true, £(A) is a one-to-one mapping of the half 
line [0, © ) onto the interval (0,1). 
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(i) Making use of the fact that Io (4) = I,(A) we can write 
d&(d) _ —Jo(A) NO) + HQ) 


aX (Iola) )? ny 
Since* 
Ip(X) — 1 er? diy 
WT 40 
we have 
L(A) = Ip’) = =) cos ge °° * dy, 
T “0 

and 


OOS : cos’ ge °°" ° de. 
Tv Yo 
Thus (100) becomes 
dé(X) 
dx 
is w : ; T 2 (101) 
_ -+ | prove? dp | cos” per rr # dy +4( cos pe* o8# ie) 
= Tv 0 0 Tv 0 : 
[Zo(A) ? 
By the Schwarz inequality 


T 2 T T 
(/ cos ye cos ~ de) < ( cos” ve cos y ie)( | Ps cos ¢ i) 
0 0 0 








(the strict inequality holding). Hence aM ) < 0 and (2) follows. 
- is AOY = ae 10) 2: 
(iz) €(0) = 1 T(0) > 1 oo 1. 


(zit) We make use of the asymptotic formula for Io(x) and JI,(zx) 


for large xf 
ce 1 1 
In(x) = \/ 2x E — Sx +0 (5)| 


l(c) = ye [1- 2 +0(4)]. 


Substitution of (102) into (99) yields (cz) immediately. 


* Ref. 4, p. 76. 
T Ref. 4, p. 86. 


(102) 
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Let us remark here that since Jy)(x) and J,(x) are even functions of x, 
if \(E) = a 2 Ois the unique nonnegative solution to (98), then \ (£) = 
—a is the unique nonpositive solution to (98). 


APPENDIX C 


Completion of Asymptotic Estimates of R(@) 

We have defined V,,(r) as the volume of the sphere S,(z,r) = {x ¢€ 
@n: d(x,z) <r}. Due to the symmetry of @, , Vn(r) is independent of 
z. Thus, we shall take V, (r) as the volume of 


(ir) = ‘x Sita eae. oe: (2 a %) < rt. 
k=1 


In this appendix, we evaluate 


lim 1 (23)” 
at —_~—_“——- Aff 
now = In Van) AE,. (103) 


We shall find #, by solving an equivalent probability problem: Let 
X1,X2,-+-°- be a sequence of independent random variables uniformly 
distributed on the interval [—7,z]. Let 


n 2 

Y,= & (asin) 
k=1 2 

It is clear that 

V(r) 

(25) 





Pr[Y, <r] = (104) 


hence, 


—lim(1/n) In Pr[Y, < an] = £,. (105) 


We now make use of 


Chernoff’s Theorem:’ Let Z,, Z,, «++ be a sequence of independent identi- 
cally distributed random variables with moment generating function E[e”*') 
= M(t). Let 
Py = Pr| 3% an], 
k=1 
wherea S E(Z,). Then 


22 2 ia Py = Inm, 
n 
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where m = mine M(t). 
t<0 


If we set 


_ | 
Zk = [2 sin *| } 
where X; is the above random variable, then 
Yn = 2, Zi: 
k=l 


Thus, from (105) and Chernoff’s Theorem, #, = —|ln m. 
The moment generating function of Z;. is 


M(t) = Ele7**] = = i exp {(2 sin 4) ; dx 


1 Tv 


ee 


(106) 
pore cos x) dz = e''Io(2t). 
Hence, 


m = min e* ®'J(2t). (107) 
t<0 


To find the minimum, set the derivative of (107) equal to zero: 
0 = eT — a)Io(2t) + 2: (22)I, 
so that the ¢ which minimizes (107) satisfies 


a, i I,(2t) 
i Io(2t) © 


The solution to (108), for ¢ < 0, is 24 = —Xd(a/2), where d(£) is defined 
by (12). (See the remark at the conclusion of Appendix B.) Hence, 


from (107) 
n-eol-(-).@]o6Q)- ow 


~nm= —n o(a(5)) + (1-5) (9) 
- (5) 


where C'o(£) is defined by (16). 





(108) 


so that 


& 
I 


(110) 
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Applying (110) to (66) yields Ry (8) = Co(7’B), and applying (110) 
to (69) yields Ri (8) = Co(@). 


APPENDIX D 


Exponential Error Bounds 


It is known that for any time-discrete (amplitude continuous) 
memoryless channel the smallest attainable error probability P,* (n,R) 
nit 


for an n-dimensional code with e”“ code words may be written 


P3(n,R) = exp [—nE(R) + o(n)], (111) 


where E(R) > Owhen & < C (the channel capacity in nats per symbol). 
Although E(R) is not always known exactly it can be estimated by 
upper and lower bounds. The best known lower bound on E(R) is 
given in Gallager (Ref. 8, Theorem 10) and the best known upper bound 
on Z(R) by Shannon, Gallager, and Berlekamp.’ 

Let P (y| x) be the channel transition probability density. We assume 
that any n-sequence of input symbols is an allowable channel input 
—e., no “input constraint”. The bounds of Refs. 8 and 9 can then be 
stated as follows: 

For any p 2 0 and input probability density f(x), let us define 


E(p,f) = Eo(p,f(x)) — pRo(p,f(x)), (112) 


where 


Eo(p,f(2)) = —In [ay il ae fla)P(y| ayn | (112a) 

and 
Rolo,fla)) = 5 Hol of(e)). (112b) 
With p = 0 specified let f,(a) be that input density which maximizes 


E(p.f(x)). It is shown in Ref. 8 that with p fixed f,(v) is the unique 
density which satisfies 


[ay P(y|x)""*a?(y) 2 | asly)"*ay, all x, (113) 
y uy 
with equality if fp(~) + 0 (all x) where 


aie [ t@Pyy | 2)! ¥Pde, (113a) 
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It can be shown that with p = 0, fo (x) is that input density which 
achieves capacity C, and 2,(0,fo) = C. In most channels of interest 
Ro(p,f,) decreases from C' to 0 as p increases from 0 to ~. 
We define the rate R.parametrically in terms of p by 
7 R = R(p) = Ro(p,fo(x)). (114) 
Then for 0 < p < 1, which corresponds to R,(1,fi(c)) < R < C, the 
exponent is known exactly: 
E(R) = E(p,f,) = Eo(osf.) — ek (115) 


where E, Ko , and f, are defined by (112). For p = 1, which corresponds 
to R s R.(1,fi), the (“sphere-packing”’), upper bound on #(R) is 


E(R) S Eo(of,), (116) 
and the (“random-coding”’) lower bound is for 0 < R < R.(1,f,) 
E(R) = Ey(ifi) — BR. (117) 


This estimate of E(R) may be improved for low rates 2. It is shown 
in Ref. 9 that if Z* (2) is an upper bound on E (2) which is sharper than 
the sphere-packing bound (116) for low rates R (such a bound can al- 
ways be found), and if Z* (2) and the sphere-packing bound are plotted 
versus &, then their common tangent is also an upper bound on E(R). 

The lower bound may be sharpened for low rates R ag follows. For 
p = 1, and input density g(x), define 


Ez(p,g) = Eoz(p,g) — pRoz(p,9), (118) 
where 
Eoz(p,g) | 
= -pIn fotz)ae f o(e!aa'| [ Pty lay'PCol 2a] (18a) 
and 


Roz(p,g) _ 5, Bue(oa). (118b) 


Then for any fixed g(x) and with R again given parametrically in terms 
of p by 


R= Roz (p,g); (119) 
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the (“expurgated’’) lower bound is 
E(R) 2 E.(p,g). (120) 


We shall now apply these results to our channel using the time-dis- 
crete model defined before and after (26). Here the input is a number 
X ¢[—7,7x], and the output is a pair of real numbers (Y,, Y2). If X = 
x is the input, then the conditional transition probability density is the 
two-dimensional 


Sica 


= — exp {—[(y1 — ~/28 cos x)” + (yo — W/2S8 sin x)]/2N} 


or in polar coordinates 


P(rw|x) = rP(r cos g,r sin g| x) 


_ ft ~ sin —r2j2N _ Vv 28! mo 
=xye @ exp ( 7 —7— 608 (yp *)) ; 


It may be verified by substitution into (113), that the input density 
f(a) = 1/29 maximizes FE (p,f(x)) for all p = 0. Further a direct sub- 
stitution of (121) into (112a) yields after a straightforward computa- 


tion 
aA . —Ay2 2QAv ee 
Eo(p,fo) = —In 2Ae I ve E (; = .) | dv, (122) 


where A = S/N, the signal-to-noise ratio. The rate, R, can be gotten by 
differentiating (122) with respect to p. This yields 


(121) 








R(p) 


5 Eo(p,fo) 
m —Ay2 QA ih vA 
|- | ni 1 (e -) ‘in to(; a 
2A ae —Avty e(; aa) (74 
" (1 + p) Jo me fy 


i —Av2 2Av 
[ n( Py” dv. 


After some manipulation one can show that R(p) | ,.0 = CG, the channel 
capacity as given by (12). The estimate of the exponent E(k) of (115), 
(116), and (117) is plotted versus R in Fig. 7 for signal-to-noise ratio 





(123) 








a) el/ 
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Fig. 7 — Upper and lower bounds on the error exponent £ (R) vs. R for signal - 
to- Hai ratios of (a) A = 2, (b) A = 20. (Curve A is the exponent # (R) in the 
range where it is known exactly (115). Curve B is the “‘sphere-packing’’ upper 
bound on # (R) (116). Curve C is the ‘‘random coding”’ lower bound on EF (f) 
(117). Curve D is Z* (R), the low rate upper bound (133). Curve E is the common 
tangent to E* (R) and the sphere-packing bound. F () lies in the shaded region, 


A = 2,20. The sphere-packing upper bound and the random-coding 
lower bound diverge for small rates R. We shall improve this situation 
by computing the low-rate expurgated lower bound on E(R) (120). 
If we again choose the input density to be g(x) = 1/22, -r Sa Sr 
we have from (121) 
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tP(y|a!)¥dy =< | arate exp (-S/N. — 7? 
[ Pu |2)'Puyle'}ay = f dr 5—x; exp ( S/N. — r°/2N) 


fo exp (- ¥ (/% [cos (p — x) + cos (g — 21) dy 


cs ee — _ 2 
> dr 5 exp ( S/N — r/2N) 


[ exp (- VS Br cos (yg — «)) dy 


a — [ rexp Sag ek (~ Br) dr, 


‘ 
~~ 
, 





where 


* . / 
sae tse | 


cos x + cos 2’ 


B=~/1+ cos (« — 2’) and a= tan™ | 
This integral is tabulated [Ref. 6, p. 198, #5] so that we have 
[ Puy|2)'PGy| 2’) dy = exp (—(8/2N)U — cos (x — 2). (124) 
y 


Substituting (124) into (118a) yields 


(2 ™ da" 
—pln bok au 


eon dg 26 


Eor(p,g) 
-exp [—S/2Np — (S/2Np) cos (x — 2’)| 
—p nf & — exp (—S/2Np)Io (58. .) 


—p In [exp (—A/2p)Io(A/2p)], (p 2 1) 
where A = cll . The rate R is given parametrically in terms of p by 


(125) 


\ = A Li(A/2p) _ 
p = 95 TAP) BAM) aoe 


| (p 2 1). 


Let us note that as p > © R(p) — 0. The expurgated bound is given 
by (120), (125), and (126), It is easy to show that as p— ©, (R — 0) 
the lower bound LE, (p,g) — A/2. 

We shall now obtain a sharper upper bound for low wie E* (R) 
which will in fact have E*(0) = A/2, establishing that H(O) = A/2. 
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Let us denote by p,(J7), the smallest maximum (normalized) correla- 
tion obtainable for an n-dimensional polyphase code with M code words. 
Paralleling arguments of Shannon (Ref. 10, pp. 647-648) it is not hard 
to show that the error probability for a code with WZ = exp (nR) code 
words satisfies 


P, 2 4Pr [error in a code with two code words with energy ST 
and correlation p, (47/2) in white Gaussian noise with 
spectral density N,]. 


The right member of this inequality is known [Ref. 11, (88)], and is 
equal to 

ip(_ ,/SL(,_, (% 

2? ( No 1 Pn 9 ) (127) 


1 = P 
b(x) = m6 is eo du 


is the cumulative error function. 

We now bound F(R) by finding a bound on p, (7/2). Since for large 
M, a code with M/2 code words has about the same rate as one with M 
code words, it will suffice to bound p, (M7). With MW = exp (nk) and 
n large, we have from (17) (since B = (1 — p)) 


where 


Rs O(1 — Vp,()). (128) 
If we define R parametrically by 
R= R(c) =C(c), Oe <1, (129) 
we have from (129) 
c21—Vo.(M) (130) 
or 
[1 — p,(M)] S$ o(2 — oc). (131) 


Substituting (131) into (127) yields 


P. S 3@ (- / i o(2 — °)). (182) 


Making use of the well-known asymptotic formula for the cumulative 
error function 6(—2x) & (1/+/2r2) exp (—2’/2) (large x), we obtain 
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from (132) for large 7 (and therefore large n = WT’), the upper bound 
on the error exponent 


E(R) = — lim > InP. S 


No 


Se(2—0) SBR) (133) 


where A = S/N,.W = S/N. When R = 0,¢ = 1, s0 that H(0) = A/2. 
The expurgated bound and the bound of (133) are plotted in Fig. 7. 
The upper bound is, of course, sharpened by drawing the common tan- 
gent of E*(R) and the sphere-packing bound. 


APPENDIX E 


Asymptotic Estimates; of Co(E) 
In this appendix we obtain estimates of Co(~) as & > 0 and é > 1. 
E.1 Small &: We show here that 
Cole) = INE + aal€) (134) 


where ¢:(£) > Oas&— o. 

From proposition (zz) in Appendix C, we know that as & — 0, 
A(€) — o. Again making use of the asymptotic formula for J)(«) and 
I, (av) for large x (102), we obtain by substitution into (15), 


g= x1 +0 ()]. (135) 


+ 011) = xt k + e(€), (136) 


or 


where e(£) > 0 as > 0 and k is a constant. Substitution of (136) into 
(16), and another application of the asymptotic formula for J (x) yields 
(134). 


E.2 Large &: We show here that 
Co(é) = (1 — €){1 + O10 — £)} (137) 


as £— 1. As above our first task is to estimate \ (£) when £ is near unity 
or A(é) is near zero. We need the asymptotic formulas for Jo(z) and 
I,(a) for x near zero (Ref. 4, p. 77): 
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2 
I(x) = 1 +5 + 0(2*), 


: (138) 
=F 4 gs 
I(z) = oo i6 + 0(2°). 
Substituting (138) into (15) yields 
ei ge ER = _A 3 
f=1 Foo. 5 t O(N’). (139) 
Setting £ = 1 — £ we have, 

f= + 00°). (140) 


We show that 
= 26+ 0(&) = 201 — £) + 0((1 — £)). (141) 


Equation (141) follows on setting « = X — 2€ and observing [from 
(140)] that 


00°) : 
E #: 00)" | 


as \ > 0 or £->0 (€— 1). Substitution of (141) into (16) and another 
application of the asymptotic formula for J)(x), yields (137). 


3 = —k 


? 


UTe>] & 


APPENDIX F 


Comparison of Alodulation Schemes 


In this appendix, we shall describe an amplitude modulation scheme 
and compare its performance with that of the phase modulation scheme 
studied in this paper. 

Referring to (21a) we see that our phase modulated signal may be 
written (during the kth subinterval) 


s(t) = vx sin ao2tWt + az: cos a2cWt, (142) 
where from (21b) 
[tn + fag? PR = 28, k= 1,2,---,n. (143) 


Consider an amplitude modulation (AM) scheme in which the signals 
s:(t) are given by (142) but with (148) replaced by the ““mean square” 
constraint 
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SS {fran OP + fen? P} S 28n = 2WST. (144) 
k=1 
The resulting signals s;(t) are then amplitude modulated signals with 
carrier frequency a27W radians per second and average power 


1? 4 1< kT /n ; 
af sw@a= a> | s2(t)dt < 8. (145) 
T Jo T t= Yce=1yryn 

Thus, in this case the signals are constrained to have average power not 
exceeding S. It is clear that, as for the phase modulation, the signals of 
the ath and Sth users of the channel are orthogonal so that we may again 
take the bandwidth (i.e., difference in carrier frequencies of adjacent 
users) to be W eps. Further, it follows from the analysis in Section III 
that this channel is mathematically equivalent to the time-discrete 
channel Gaussian channel considered by Shannon.!°!? This channel 
accepts real numbers at a rate of 2W per second and adds to each num- 
ber an independent Gaussian variate with mean zero and variance NoW. 
Messages are encoded in blocks (vectors) of 2WT real numbers (which 
take T seconds to transmit), each 2WT-vector having the sum of the 
squares of the coordinates not exceeding 2WST. Shannon has found the 
capacity of this channel to be (in nats per second) 


S 
where A = S/NoW, the signal to noise ratio. Equation (146) is plotted 


in Fig. 2 so that it may be compared to the capacity of the polyphase 
system. Note that for small A, In (1 + A) & A so that from (14) the 





Fig. 8 — Lower bound on the exponents E,(R) (curve A) and #,() curve B). Cp 
and C, are the capacities of the polyphase and AM systems, respectively. The 
signal-to-noise ratio A=2. 
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capacities of this AM scheme and the polyphase scheme are nearly the 
same. 

Further, letting P.»*(7,R) and P.a*(T,R) be the smallest attainable 
error probability for a code with parameter T and rate R nats per sec. for 
the polyphase and AM systems, respectively, we can write 


Pep* = exp [—-TE,(R) + 0(T)] 
Pea* = exp [—TEA(R) + 0(T)]. 
The exponent #,(R) is estimated in Appendix D and may be written 
L,(k) = WE(R/W), 


where E(R) is defined by (111). The exponent F,(R) is estimated in 
Refs. 10, 8, 9, 13. The exponents are compared in Fig. 8 for A = 2 by 
plotting their known lower bounds. It is also possible to show that 
EO) = E,(0) = AW/2 for all A. 
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Synchronization Recovery Techniques 


for Binary Cyclic Codes* 


By S. Y. TONG 


(Manuscript received December 17, 1965) 


A class of binary block codes capable of simultaneous correction of addt- 
tive errors and synchronization errors ts presented. This class of codes con- 
ststs of coset codes of binary cyclic or shortened cyclic codes, and retains the 
implementation advantages of binary cyclic codes. In most cases, the re- 
dundancy required to provide additive-error correction is sufficient to give 
synchronization-error correction so that no additional redundant bits are re- 
quired. 

Synthesis procedures to construct such codes are also presented, along 
with an upper bound on the number of synchronization errors which can 
be corrected by codes in this class. 


I. INTRODUCTION 


In serial-type data transmission systems, alpha-numeric characters 
are ordinarily represented by groups of binary symbols. To get mean- 
ingful information transfer, it is necessary at the receiver to correctly 
partition the incoming bit sequence, i.e., to establish and maintain 
“Character Timing’’. It is well known that channel noise not only pro- 
duces additive errors but also can cause timing errors; consequently, 
methods to correct timing errors have been suggested by many authors. 
Usually these methods require special coding of the messages, as in 
comma-free codes,’”*"*” or the insertion of synchronization sequences 
between blocks of messages.” 

A similar timing problem exists in systems where error control is 
employed; the problem is transformed from character timing to word 
synchronization, or, equivalently, the ability to distinguish information 
bits from check bits. Codes that protect word synchronization as well 


* This is a part of the Ph.D. dissertation submitted by the author to the De- 
partment of Electrical Engineering, Princeton University, Princeton, N. 
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as correct additive errors have been investigated. Sellers’ has proposed 
a scheme where a burst-error-correcting code interlaced with additional 
check bits is used to give limited protection against synchronization 
loss, or provide burst-error correction when synchronization is main- 
tained. Stiffler® has derived a necessary and sufficient condition for the 
existence of coset codes such that the sequences produced by slipping 
the word framing by r bits will not be code words. Such coset codes are 
useful for systems where coding is used for error detection only. 

In order to utilize such a coset code for both additive-error correction 
and synchronization-error detection it is necessary that synchronization 
errors not result in decodable sequences. A condition sufficient for this 
has been obtained by Levy.” However, a more useful result would be a 
condition which would enable correction of both types of errors. 

In this paper, a technique for obtaining codes capable of correcting 
synchronization errors as well as additive errors is presented. J'urther- 
more, it is shown that in many cases correction of synchronization errors 
is possible even in the presence of additive noise. Generally, the scheme 
requires no additional check bits and the implementation is simple. 

In addition to these fundamental results, a set of coset codes, optimal 
with respect to synchronization error detecting ability is obtained; this 
represents an improvement of Levy’s results.” 


1.1 Definitions and Preliminaries 


To correct word-syne loss for an error-correcting code there are two 
conditions which must be met in order not to reduce the normal error- 
correcting capability of the code. The first condition is that an error 
pattern caused by syne loss must not be in any of the cosets which the 
code utilizes for correction of additive errors. This will ensure that the 
loss of syne will not be interpreted by the decoder as additive noise, and 
vice versa. The second condition is that the set of error patterns caused 
by misframing in one direction must be disjoint from the set of error 
patterns caused by misframing in the other direction. If the second con- 
dition is met, one can proceed to correct the word framing error itera- 
tively. For the detection of syne loss, the first condition is necessary and 
sufficient, while for the correction of sync loss, the second condition must 
be satisfied. If the code is used for detection only, the first condition re- 
duces to the requirement that the overlapping of any two code words 
should not be another code word, so that syne errors can always be de- 
tected as an erroneous word, although one would not be able to dis- 
tinguish sync loss from additive errors. 
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Let 
A = (a, @2, +++ , Qn) 
B = (1, b2,-°-- , bn) 
C = (4, C2, °° ,€n) 


D= (d; , do, cae » dn) 
be code words, not necessarily distinct, then: 


Definition 1: A synchronization bit loss (or bit loss) of r bits in word 
framing is said to occur if the receiver bit counter is r bits behind what 
it should be. That is to say, if the sequence a, --- ,@n, bi, -++ , bn is 
framed by the receiver as Qn—r41,°** »Qnbi, +++ 5 One where r < [n/2] 
and the message is taken in such a way that b, is the first bit of the se- 
quence to arrive at the receiver. 


Definition 2: A synchronization bit gain (or bit gain) of r bits in word 
framing is said to occur if the receiver bit counter counts r bits more 
than it should. Thus, a, --- ,@n, b1, --+ , 6, is framed by the receiver 
AS Ary1, *** , Ond1,-** , 6, where r < [n/2]. 


Definition 3: Tf (Qn-is1, +++, Qnbi,+°+ bri) and (isqidize, «++, 
Gnbibe , --+ , b;) are not code words for every pair, A, B and all 2, 


Oma, 
then the code is said to have comma-free freedom r. 


Definition 4: A correctable coset of a code is defined as a coset whose 
leader is one of the error patterns the decoder corrects. 


Definition 5: If the sequences @j4:4:42, °+* , QndDibe, +++ , b; and @y_j41, 

- ,@,b1, -+* , ba-« do not belong to any of the correctable cosets of 
the code for every pair A, B and all z, 0 < 2 Sr, then the code is said 
to have syne-detection capability r. 


Detinition 6: A code is said to have sync-recovery capability r if the 
code has sync-detection capability g 2 r and if the cosets containing 
@ist, °°, Anb1, +++ , b; are disjoint from the cosets containing ¢p_;41, 

- Cnt, °** , dn; for all 0 < 7,7 S r and for every set of A, B, C, 
D of the code. 


Definition 7: A code is said to have guaranteed noise tolerance of (k,r) 
bits, if the code is guaranteed to correct r bits of syne slippage with k 
or fewer additional bit errors in the received block. 
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Definition 8: A code is said to have expected noise tolerance of EH (r) 
bits, if with a probability of at least 3, the code can correct r bits of syne 
slippage with H(r) or fewer additional bit errors in the received block. 

In the subsequent discussion all the error-correcting codes are as- 
sumed to correct random errors. 


1.2 Summary of the Results 


Consider a binary cyclic code which corrects ¢ 2 2r + 1 errors, r > 0. 
If such a code is shortened by 2r + 1 bits or more, it is shown in Section 
II that the code can be made to have sync-recovery capability r with 
expected noise tolerance of #(8) bits when a slippage of 8 bits occurs 


28 28 
ma) =2* 5 -(,%,), o<psr 


without added redundancy. A similar technique is developed for codes 
which correct more than one error. It is shown that by adding 2r zeros 
to each code word and shortening the code by 2r + 1 bits or more that 
the code can be made to have sync-recovery capability of r bits. 

A scheme which is applicable to a single error-correcting code is also 
developed. It is shown that for any error-correcting code without an 
even-parity check it is possible to have sync-recovery capability of one 
bit if two information bits of the code are replaced by two zeros. In 
Section III, techniques are developed for binary cyclic codes which are 
not shortened. A necessary and sufficient condition is derived for the 
existence of a coset code having specified sync-correcting ability. It is 
also shown that a cyclic code which corrects ¢ errors and has minimum 
distance d, can be made to have sync-recovery capability 


r Sdm— 2t — 2 


without additional redundancy and an optimal set of codes that assures 
r = dm — 2t — 2 1s given. 

For cyclic codes with some special properties, it is shown that a syn- 
thesis procedure can be used to construct coset codes of the given code 
so that one bit out-of-syne can always be corrected. This procedure 
applies to almost all of the Bose-Chaudhuri” Hocquenghem” codes 
that corrects more than two errors. 

Bounds on the amount of slippage which can be corrected are derived. 
It is shown that sync-recovery capability for any binary cyclic code can- 
not exceed [(n — k — 1)/2]* bits and syne-detection capability cannot 


* [z] denotes the greatest integer less than or equal to zx. 
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exceed n — k — 1 bits, where n is the block length of the code and k 
is the number of information bits in the code. A brief discussion on the 
syne-detection capability of error-detecting codes is included in Section 
IV. 


1.3 Properties of Cyclic Codes 


A subspace V of I-tuples is called a cyclic code if for each vector 


v = (do,a%,°-+,Q14) in V, the vector v = (ap1,a0,-+* , @1-2) is 
also in V. 

By considering each /-tuple as an element of the algebra A, of poly- 
nomials modulo x’ = 1, one may associate each I-tuple (ao, +++ , a1) 
with a polynomial f(x) = a) + aw, -*: , a4’ ’ in the residue class 
modulo x’ — 1.* It can be shown that a subspace is a cyclic code if and 
only if it is an ideal in the algebra of polynomials modulo x’ — 1.” The 


generator g(x) of the ideal is known as the generator polynomial of the 
cyclic code. It follows that 1, which represents the natural length of the 
code, must be the least common multiple of the roots of the generator 
polynomial of the cyclic code. Given an (i,k) cyclic code, it is always 
possible to form an (1 — i,k — 7) code by making the 7 leading infor- 
mation bits identically zero and omitting them from all code vectors. 
Such a code is no longer cyclic, and is called a shortened cyclic code. 
Denote the code space of a cyclic code by Cy and the code space of a 
shortened cyclic code by C;, where 7 is the number of bits shortened. 
Let n be the block length of a shortened cyclic code (i.e.,n = 1 — 2), 
the higher order J — n bits are identically zero and hence are not trans- 
mitted. We can imagine that the receiver will decode the shortened code 
by first augmenting the received n-bit code word by | — n zeros and then 
decoding it as if it were a full-length cyclic code. (Note that the actual 
receiver need not perform these precise functions, but in any case it has 
to do something equivalent to this.) 

Now let us investigate what will happen if an r-bit loss occurs as 
shown in Fig. 1. Given that the transmitted message is R(x), the re- 
ceived message is 


x R(x) + A(x) + 2"B(z) (1) 


where A(x) is the portion of next word entered into the framing and 
B (a) is the higher order r-bit portion of R (2) out of framing. In general, 


* This paper discusses codes over binary field only so that the coefficients of 
polynomials are either 0 or 1 and + or — signs are used interchangeably. 
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aa poet 





Fig. 1— The situation of synchronization loss. 


A(x) and B(x) are not predictable and one may write the received 
message in the following form 


xv R(t) + 61° (v) + 262" (x) (2) 


where 6,’ (x) represents a polynomial of degree at most r — 1 with ran- 
dom coefficients. For cyclic codes, n = 1 and z' = 1. Therefore, 


bi (x) + vb" (x) = &" (x) + &:' (x) = F(x), (3) 
so that the received message is 
x R(x) + & (x). (4) 


Since 2’ (x) is an element of Co, it is divisible by g(x) so that its syn- 
drome is 0. The coset leader of the coset to which 6’ (x) belongs must have 
a weight of no more than r.* For the same reason, with a shortened cyclic 
code the coset leader of the coset of 8 (x) = [8,"(x) + x78," (x)] must 
have a weight of not more than 2r. It can be shown that a similar result 
holds for an r-bit gain. It follows that: 


Theorem 1: A slippage of r bits in synchronization can result in a vector 
at most distance 2r from a nearest code vector if the code is a shortened cyclic 
code and at most distance r from a nearest code vector if the code is cyclic, 
where code vectors are elements of Co. 


II. SCHEME FOR SHORTENED CYCLIC CODES 


2.1 Codes that Correct at Least Three Random Errors 


Suppose a fixed polynomial, P(x) is added to every code word at the 
encoder and the same polynomial is subtracted from the received mes- 
sage at the decoder. If the sync is maintained properly, the effect of 
P(x) will be canceled. However, if syne is not maintained, an error 


* We assume that the coset leader is the minimum weight element of the coset, 
which is the desired case for a random-error correcting code. 
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pattern may be detected. By choosing P(x) in a suitable way, it is con- 
ceivable that one may be able to detect or even correct syne slips. 

Let the code word be R(x) ¢ C; = Cy, where x’ = 1 and nis the 
length of shortened code word. The transmitted word is R(x) + P(a). 
At the receiver, assume the word framing has an r-bit loss as shown in 
Fig. 1. The received word, Q(x), according to (2), takes the following 
form: 


Q(x) = a'[R(x) + P(x)] + br) + 262" (x). (5) 
The receiver then subtracts P (a) from Q(x), i.e., 
Qi(z) = Q@) + P(e). (6) 


The syndrome of Q, (x) is the remainder of Q,(x)/g (2), i.e., 
{Q:(z)} = {Q(@) + P(@)} 
= {1 + 2°)P(x) + &'(@) + 26" (x)} 
where {x} represents either the residue class of x modulo g(x) or the 
polynomial of at least degree in that class; the context should make it 
clear which is meant. If P (2) is chosen so that the coset { (1 + 27)P(x)} 
coincides with {a? + 6,’(~) + 2"6."(a)} , where n S 7 < 1, and 6; (x) 
is a polynomial of degree at most r — 1, forj = 1, 2, then, 
{(1 + a")P(x) + 5: (@) + x62 (x)} 
= {at + O(a) + 2x" (x) + bi (we) + 25x" (x)} (8) 
= {a + 6'(x) + 28," (x)} . 
Note that 2° + 63’(7) + 26,"(x) has at most 2r + 1 nonzero terms. 
Thus, if the code Co corrects t 2 2r + 1 errors, the polynomial 
xi + $3 (x) + x"d4 (x) 
must be a coset leader of Co for all possible 6 (~)’s. Thus, ifn + r S 1, 
x? cannot be canceled by the 6’ (x)’s and the decoder will indicate that 
the 7 + 1 position of the received word is in error. But n S 7 < 1, so 
the 7th + 1 bit was not transmitted; this can be used to indicate that a 
misframing has occurred. 
Similarly, it can be shown that if the receiver has an r-bit gain, then 
the message to the decoder is 
Qo(x) = x "[R(x) + P(x)] + 2h’ (we) + 2" "5 (e) + P(x) (9) 
{Qo(x)} = {a + 28s (x) + 2" Sy (x)} 
= {x “Q(a)} for some Q, (x). 


(7) 


(10) 
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Since each possible Q;(2) is a coset leader for all 6’ (a), so is x "Qi(z). 
Thus, by reasoning similar to that employed in the bit loss case, an r-bit 
gain can be detected. 

In order to recover synchronization, one must be able to distinguish 
the syndrome due to bit loss from the syndrome due to bit gain. This 
implies: 


{ar + 85" (x) + wb" (x)} fa + 6s (a) +e” Sy (x)}. (11) 


That is, the error patterns must not be in the same coset. 
fa oa A bs (x) + 8" (x) + 85" (x) + a” “6S (e)} 40. (12) 


The weight of the polynomial within the bracket is at most 4r + 2. 
Since the code is ¢ = 2r + 1 error correcting, any code word must have 
a weight of at least 4r + 38. Hence, the inequality is always satisfied 
provided that either x* or z’” is not canceled by the terms of the 6; (zx) 
polynomials. Otherwise, it would be possible for all other terms of 6; (x) 
to be zero, resulting in a zero polynomial and thereby contradicting (12). 

It is clear that in order to do this it is necessary and sufficient to have 
either x’ or «’” or both not in those positions where the 6’s may take 
the value of 1’s; that is to say, either 


Pet = Lee Pee (13) 
or 
nt+r—-l<t—-r<l—r (14) 
must be satisfied. Both conditions require 1 — n 2 2r + 1 which is the 
minimum number of bits required to be eliminated. 

Thus, we have shown that if: (7) a cyclic code corrects t = 2r + 1 
errors; (77) the number of eliminated bits is at least 2r + 1; and (777) it 
is possible to find a polynomial P (a) constrained by (8); then,upon an 
r-bit loss or gain, the syndrome generated will be different from that of 
any of the correctable cosets and every bit loss syndrome will be different 
from any of the syndromes caused by bit gain, and vice versa. To com- 
plete the analysis, one must show the existence of polynomials P (x) 
constrained by (8). 

Recall that the requirement on P(x) is that 


{1 + #’)P(x)} = {a + O (x) + 20." (x)} . (15) 


Since the 6’s are arbitrary polynomials, [as they will be combined with 
5’s, see (8)] we may treat them as variables in determining the simplest 
possible P (a). In particular, if one elects to satisfy condition (13), i.e. 


n+r-—-1l<iti<l-—-r 
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and to minimize the number of bits to be eliminated, the smallest 7 
should be selected. Therefore, let 7 = n + r. For reasons which will be- 
come apparent later,* we set 6,"(~) = O and @,"(x) = 1. Then 


{1 + 27)P(@)} = {2"" + 2°} 
which implies 
{P(x)} = {x"}. (16) 


Thus, we have constructed a P (a) which satisfies (15). This completes 
the derivation of P(x) for the detection and correction of an r-bit gain 
or loss. To see if this pattern is also good for any 8-bit syne slippage 
(0 < 6 Sr), we note that the error pattern for a B-bit loss is 


(1 + 2°)P(a) + 6P (x) + 262’ (x) 
=a +a" + P(x) + 2" (x) (17) 
= a" 4 5 P(x) + 256 (2). 
The number of nonzero terms of the polynomial is at most 
28+1<2r+1s4, for all Bi <7; 


hence, identification of an error in position n -+ 8 + 1 is always possible. 
Sincen <n+ 6+ 1 <1, and position n + 6 + 1 was not transmitted, 
the fact that the decoder will show the n + 6 + 1 position to be in error 
can be used to indicate that an out-of-syne situation exists. By a similar 
argument, it can be shown that the detection of any 6-bit gain is also 
possible for 8 <r. 

In order to recover synchronization, the set of syndromes correspond- 
ing to bit loss must be different from those corresponding to bit gain; 
that is, 


baer eye) eds (2D) 
—Bs B —By B (18) 
x fa" + 26 (x) + &” “8s ()} 
for all 
O<a,BSr 
or 


fo" ** + a” + 8: (e) + 2s" (2) + 26% (2) + x” *8P(@)} #0. (19) 


* To maximize expected noise tolerance, it is desirable to keep the degree of 
6"(x) small. 
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The number of nonzero terms within the brackets (19) is at most 
2a +284+2<44+3<2t+1Sdn 


where d» is the minimum distance of the code. Equation (19) cannot 
represent a code word unless both x”** and x” are canceled by some terms 
of the 6’s in such a way that all an-zero polynomial results. It is seen from 
(19) that n + a <1 — B isa sufficient condition for x”*¢ not to be can- 
celed by any of the 6’s. Since 1 — n = 2r + 1>a-+ 8, this is always 
satisfied. Thus, we have shown that a shortened cyclic code which cor- 
rects ¢ random errors will have sync-recovery capability of r bits, if 
lL—n 2 2r+1,¢ 2 2r + 1, and if the syndrome of x” is added to each 
word after encoding and before decoding. 

The implementation of this scheme is obviously easy. The generation 
of {~"} can be accomplished by adding one bit corresponding to position 
x” at encoder and decoder. (Since this position is not actually transmitted 
over the channel, only the syndrome of x” is added to the encoded word. ) 
Usually only a slight loss in efficiency results from the shortening of the 
code. 

The decision rule can be formed as follows: 

(<) If the decoder indicates that the bit corresponding to x” is in error, 
but not any x", n < k < 1 — 1, then one assumes the system has gained 
a few bits in bit count. By extending the syne count one bit at a time, 
the system will regain sync in at most r word times. 

(ii) If the decoder indicates that the bit corresponding to «”™ is in 
error, 1 S 6 Sr, then one assumes the system has lost 8 bits. The word 
syne can be recovered either by using step-by-step correction as in (7), 
or by a one step correction. 

Note that the P(«) derived here is but one of many possibilities; for 
example, by letting {P(x)} = {a7} one would come up with a similar 
decision rule which favors bit-gain correction rather than the bit-loss 
correction as we have done. 


Example: Consider the (23,12) Golay code shortened to a (20,9) code. 
Since it is a triple error-correcting code and since 1 — n = 3, it should 
have sync-recovery capability of 1 bit. Let us demonstrate this fact by 
step-by-step computation. The generator polynomial for the code is: 


g@)=ltervtettae tot ate" 


| 


= 101011100011. 
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The syndrome of x” = x” is 00101101111 = {P(x)} and of 2"*' = 2” is 
10111000110. Assume the information 000000001 is to be transmitted. 
After encoding, the code word is 


1,01011011110000000001,01 
Add P(x) 00101101111000000000 


1,01110110001000000001,01. 


The above sequence is the transmitted message. 
(2) Assume one bit loss has occurred. Thus, the received message is 
1,01110110001000000001,01. message flow 
t t 
Receiver Frame 
To this word the receiver adds P (2) and three 0’s 


10111011000100000000 
00101101111000000000 


A(x) = 10010110111100000000 (000 ) 


at the high-order end, as shown in the parentheses, to make a cyclic code. 
The syndrome of the message, A (2), is the remainder of A (v)/g (x). 


{A (x)/g(x)} = 00111000110 
10111000110 + 100000000 
= {2°} + {a'}. 


The decoder will indicate that bits corresponding to x” and 2° are in 
error, but 2” was not transmitted and 27 = vt! = x”*?, By the deci- 
sion rule just derived, one decides that a one bit loss has occurred. 

(22) Suppose a one bit gain has occurred. Thus, the received message 


is 
1,01110110001000000001,01; 
T 


add P(a) and three missing zeros. We have 


11101100010000000010 
P(x) = 00101101111000000000 


11000001101000000010 (000). 
The syndrome is 


01110110001 = 00101101111 + 01011011110 = 2” + 2”. 
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By the decoding rule we see that x" = x” is in error but not 2° for all 
k such that 20 = n < k S1— 1 = 22, hence the decoder decides that 
a bit gain has occurred. Notice that in this case, only two errors are in- 
dicated by the decoder due to a misframing of one bit. Because the code 
is triple error-correcting, syne recovery of a one bit of misframe is pos- 
sible even if there is an error due to additive noise. This feature of noise 
tolerance will be discussed in the following paragraph. 


2.1.1 Noise Tolerance 


Assume a slippage of 8 S r bits has occurred. According to (17) the 
error pattern is 


eth 4 5 P(x) + wds*(x). 


The weight of this error polynomial is at most 28 + 1, but the code 
corrects ¢ errors, so that at least t — 28 — 1 additional errors can be in 
the received block without disturbing the sync-correcting process. This 
is so because every error-bit position can be identified correctly pro- 
vided the total number of errors does not exceed t; hence the guaranteed 
noise tolerance is (¢ — 28 — 1,8). 

Since channel noise cannot affect any of the bit positions n < 7 < | 
directly, the vital bits for detection and correction of syne are not likely 
to be corrupted by noise. In fact, it can be shown that at least 2r + 2 
additional errors are required at specific locations to change the bit in 
position 2 forn <7 Sl. 

If we assume that the probability of occurrence of any nonzcro term 
of the 6’s is 1/2, we can compute the expected noise tolerance H(8) as 
follows: 

If 8 bits of slippage occurs, there is at least one error caused by the 
error pattern purposely generated by P(x), but not more than 26 + 1 
errors, in accordance with (17). The probability of occurrence of a total 
of 7 errors due to syne slippage of 6 bits is 


P= (, yee (20) 


the number of additional errors which can be tolerated with 7 errors is 
t — 1, so the expectation is 


264-1 
B(6) = 2° Y(t 4) (, on (21) 


That is to say, on the average, for a B-bit sync slippage, one can tolerate 
E(@) additional errors. For example, consider a (255,215) BCH code 
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which corrects five random errors. Since t 2 27 + 1, max = 2,1 —-n= 
2r + 1 = 5,1 = 255, n = 250; therefore, the (250,210) shortened cyclic 
code can have sync-recovery capability up to 2 bits per word. 

The guaranteed noise tolerances are (¢ — 26 — 1,8) = (2,1) and 
(0,2), and the expected noise tolerances are 


E(1) -7y6-a(,?,)=3 
E(2) = 2° 25 jo )=3 


2.2 Double Error-Correcting Codes 


The scheme just proposed cannot be used for double error-correcting 
codes because, in the worst case, for only one bit of syne slippage, it is 
possible to have three errors generated as a result of syne loss. (However, 
the probability of recovering sync loss is still high.) In this section the 
scheme is modified so that it is guaranteed to correct sync loss for double 
error-correcting codes; however, extra redundancy is required. 

Let m zeros be placed on each end of a coded message of an n-bit 
shortened cyclic code. The transmitted word is then a stream of binary 
messages interlaced with 2m-zero bits between messages (i.e., the added 
zeros are actually transmitted ). A typical word is of the form shown be- 
low 

xy! = 4a: prt am eae gat gen ae porn 


all zero eh (s) all zero 


where the first and the last m bits are identically zero and the center 
portion is the shortened code word, R(x). 
Let 2” P (a) be added to such a code; the transmitted message is 


v"[R (x) + P(@)). (22) 


It is clear that the word framing at the receiver must contain 2m + n 
bits. It follows that all terms of 2”*"[R(x) + P(x)] will be within a 
single receiver frame for all | r | S m. Thus, for an r-bit loss the received 
message, in the absence of noise, is 


a "TR (w) + P(x)]. 


After subtracting x”P(a#) at receiver, the syndrome of the resultant 
message is 
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{2""[R (a) + P(x)] + x"P(x)} 


(23) 
= {2"(1 + w’)P(z)}, lr| Sm. 
Let P(x) = {2'""", then the syndrome due to an r-bit loss is 
{e"(1 + 2")P(x)} = fat + a}. (24) 


If 2m + n S 1 — 1, the bit corresponding to x’ (which is the /th bit) 
is not transmitted. Hence, the detection of x’ in error can be used to 
indicate a bit loss. In order to correct the bit loss, note that the error 
at «'*”* corresponds to the bit at position 2 -+ r modulo I. Therefore, if 
r > 0, the error bit position corresponds to one of the first m bits which 
is known to be zero so that the inconsistency between the calculated 
error bit and the actual bit value at position 7 can serve as an indication 
of an r-bit loss. 

Similarly, for r < 0 the bit corresponding to the syndrome {x is 
ltre=WWmt+ntrt+1eim+n+4+1 forO>r = —m but the 
bits m ++ 1,m-+ n+ 2,---,2m + n are known to be zero, so 
that the r-bit gain can be detected in the same way. Notice that if 
L+rzZ2m-+ n, for somer < 0, the error indication directly shows an 
| r |-bit gain has occurred since the bit J + r is not transmitted. 

Recall that one requires 


2m +n 


sa 


IIA 


1-1, (25) 
1.€., 
l—nS2m+4+1 (26) 


so that at least 2m + 1 bits of the information symbols must be elimi- 
nated. 

Since exactly two errors (namely 2’ and x'*"") will be generated 
for every sync loss up to m bits, a code capable of correcting at least 
two errors must be used. It is obvious that for a t-error-correcting code, 
an addition of ¢ — 2 errors anywhere in the data section (i.e., in the 
x” R(x) part) can be tolerated. However, errors in the zero section can 
affect the syne-correcting process, although the syne-detection capability 
will not suffer. Thus, the guaranteed noise tolerance for sync-recovery is 
zero while it is t — 2 for syne detection. 

We summarize the sync-correcting rule as follows: 

(<) If x’ is in error then one assumes that a sync-slip has occurred. 

(it) If, in addition, one and only one of the calculated error bits is in 
the bit position corresponding to 2''”"*, |r| S m, while the actual bit 


1 


of that position is either not transmitted or has the value of zero then 
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one assumes an r-bit loss has occurred (if r < 0, | 7 |-bit loss = r-bit 
gain). 

It is easy to see from the above argument that any P(r) = {a'“}, 
12181 — 2m — n, can be used for this scheme. 


Example: Consider a (15,7) BCH code shortened to (12,4) code with 
one zero adjoined at each end of the code to obtain (14,4) code. Here 
1 = 15,m = 1,n = 12. The code is generated by 


gv) =1+teaetteaota’ ta’ 


Assume the message 1100 is to be sent. After encoding we have the 
(12,4) code word 


R(x) = 010001011100 
P(x) = {a ""}) = {2} = 001011100000 
R(x) + P(x) = 011010111100. 
Add a zero at each end, the transmitted message is 


0,00110101111000,0. 


Notice that the neighboring bits of the message must be zero since each 
message must begin and end with zero, by construction. 

(7) At the receiver assume a one-bit loss has occurred. Then we have 
the message, as the receiver sees it: 


0,00110101111000,0. 
T 
The receiver adds xP (x) and the resulting message becomes 
00011010111100 
add xP (x) = 00010111000000 
00001101111100(0). 


The syndrome is 10010111 = 00010111 + 10000000 = {x} + {2°} 
which indicates 2“ and 2° are in error but 2” is never transmitted and 
x is known to have been transmitted as zero. Therefore, according to 
the decoding rule just derived, we see that word sync has slipped. Since 
ght = gt" — x r = 1, which shows a gain of one bit has occurred. 

(21) Assume one-bit gain has occurred, the received message becomes 


as shown 
01101011110000 
add P(x) = 00010111000000 
01111100110000(0). 
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The syndrome is 00111001 = 00010111 + 00101110 = {2} + {2}. 
14 - . . . ope 13 
x in error indicates a syne-slip condition and x* = 0 
1+r—1 13 15—1—1 
a =f =a : 


so r = —1 which shows that a gain of one bit in word sync has occurred. 


2.3 Codes Without an Even-Parity Check 


Recall that, from (23), the syndrome is {x”(1 + w2')P(x)} for an 
r-bit loss. It is desirable to have such a syndrome coincide with the 
syndrome of a single bit, say «‘ *, and if the bit corresponding to 2‘ is 
either not transmitted, or if the value of the coefficient of x’ is known to 
the receiver by prearrangement, then it is possible to have sync recovery 
capability for a single error-correcting code. We shall show in the follow- 
ing that such a possibility does exist. 


Assertion: For a cyclic error-correcting code without an even-parity 
checkt it is always possible to modify the code in such a way as to have 
syne-recovery capability of one bit. The scheme is based on the following 
theorem. 


Theorem 2: If g(a) ts a generator polynomial for a cyclic code of natural 
length | and if 


GCD (g(z),1l +2) =1 

then 

(i+ 2)[{2}. 
Proof: aQ+2) fg) =g9(01) #0 

.gA)=1 or 1+g1) =0 
or 1 + g(x) is divisible by 1 + z. 
Likewise x { g(x) because g(x)h(x) = ve + 1 and if x«|g(x) then 
a |a' + 1 which is impossible. 
Therefore, 
«| [l +g @)). (27) 


It follows that 1 + g(x) = x(1 + x)F3(x). In the ring of polynomials 
modulo #’ + 1 


a =1 
= ie., the remainder of x#/g(zx). ee 
+ Or, equivalently, the generator polynomial of the code is not divisible by 
oP 


SYNCHRONIZATION RECOVERY TECHNIQUES Die 


or 
ao = a 
by (27), 
vt’ =a (1 +g(x)) = (1+ 2%)F3(x) [mod g(x), 
1.€., 


fe} = (1 + 2)F3(z). 
Q.E.D. 


Theorem 2 shows the existence of P(x) = {x '/1 + 2} for such codes. 
Thus, for any cyclic code which corrects one or more errors and whose 
generator polynomial, g(x), is not divisible by 1 + x, one may shorten 
the code by 2 bits, append one zero at each end of the encoded message 
and add the pattern aP (x) = x{2' 1/1 + 2}. The configuration is shown 
in Fig. 2. Note that the added zeros are actually transmitted. 

When the system is in synchronization, the framing of the receiver is 
as shown in Fig. 2. The receiver first adds «P (2) and then decodes the 
whole word. 

From (23), one-bit loss gives the syndrome 


{a(l + x)P(x)} = fa" “a} = {2} 


and one-bit gain gives the syndrome {a ‘(1 + 2x)¢P(a)} = {a}. 
Note that both a and x” are inserted zero bits; hence, the decoder 


can use the decision rule as shown in Table I. 


Example: Consider the single error-correcting Hamming code gener- 
ated by x + x + 1. For this code] = 15. From Table II: 


Fig. 2— The relation between message and word framing. 
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TABLE I 
Error Locator Indicates Real Bit Value Decision 
az in error 2 =0 one-bit loss 
x in error xo =] bit 2° in error 
x! in error zl1=090 one-bit gain 
z'-! in error abrl =] bit x! in error 








The code is modified from (15,11) to (15,9). Let the information bits 
be 001000101. After encoding it becomes 000100010001 010. 
Adding xP (x) = x", we have 


«P (x) = 000000000001000 
000100010001010 


000100010000010. 


Case 1: One-bit loss: 
The received message becomes 


00001000100000100 
xP (x) = 00000000000100000 
00001000100100100. 
The syndrome is 1000; it indicates the first bit in error since the first 
bit is 0. From Table I we read ‘‘one-bit loss has occurred.” 
Case 2: One-bit gain: 
The received message becomes 
0010001000001000 
xP (x) = 0000000000010000 
0010001000011000. 


The syndrome is 1001; it indicates « in error but x” = 0. So again by 
Table I we have detected one-bit gain. 


TaBLe II—GF(2') 











2? 1000 x8 1010 
zw} 0100 x3 0101 
x 0010 gio 1110 
x3 0001 gu 0111 
x4 1100 gl? 1111 
x 0110 gs 1011 
x6 0011 gis 1001 
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2.3.1 Notse immunity 


It is obvious that this scheme uses only one bit to detect and correct 
syne loss so that for codes which correct ¢ errors we can afford t — 1 
more additive errors in the code word provided that the bits z° and 2’ 
are not in error. 


III, FULL-LENGTH CYCLIC CODES 


For cyclic codes, the technique used to distinguish synchronization 
loss from additive errors for shortened codes no longer applies. One must 
select P(x) in such a way that over the range of synchronization slip- 
page r, the error patterns so generated do not fall into any of the correct- 
able cosets. This means 


d(x) + P(w) + 2°) ¢€e (28) 
and 
x” “6 (x) + P(x) (1 + 2”) €@, (29) 


for all 5° (a), 0 < s S r, where @ is the set of all polynomials that belong 
to the union of all correctable cosets. By the following lemma it will be 
shown that conditions (28) and (29) are equivalent. 


Lemma 1: Q(x) €@ 
af and only af 

x’ Q(x) €e for all 1. 
Proof: («) Suppose Q(x) ¢ @. Denote the coset leader to which Q(x) 
belongs by Q.(#); then Q(x) + Q.(%) = we(x) is a code word, but 
t'Wa(x) is also a code word and the weight of Q.(x) is the same as the 
weight of 2°Q.(x); the weight of Q,(x) is no more than é, the maximum 
number of errors the code corrects. Therefore, x°Q,(x) must also be a 
coset leader. It follows that 2'Q.(x) + 2*we(x) = x'Q (x) must belong 
to the coset whose leader is x*Q,(x). That is to say 2°Q(x) € @. 


(i) #' Q(t) €@ S x” *('Q(xz)) = Q(z) ee by (i). QED. 
Since (29) can be rewritten as: 
xc" "[5'(x) + P(x) (1 + 2’)) €@, (30) 


by the above lemma and the law of contraposition, (28) and (29) are 
equivalent. The condition (28) can be restated as: 


WQs(x)] =t+1 | (31) 
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where Q,(x) is the coset leader of the polynomial 
i (az) + P(@w)(1 + 2°) 


and W([z] is defined as the weight of the polynomial zx. 

Equation (81) is a necessary and sufficient condition for a code to 
have sync-detection capability 7. It is sufficient because if (31) is true, 
then no error pattern generated by P(x) due to slippage of s bits, 


0: 6:7; 


can be in a correctable coset; thus syne loss can be detected. It is neces- 
sary because otherwise there must exist at least one particular 6°, say 
5, such that (31) is not satisfied, then say 


WIQ.,(%)] St, for some a 
where 
Q.,(%) = 64 (4) + P(w)(1+2°), forsomes, O<s Kr. 
Then 
58(e) + P(x) (1 + 2") 


must be in one of the correctable cosets. 

To have sync-correction capability r, the code must have sync-detec- 
tion capability at least r and the syndromes of the error patterns for bit 
loss must be different than those for bit gain; that is 


{5°(@@) + Pw) + x*)} x fa" ere) + P(e) + ai}, 


(32) 
O<p,qsr 
or 
(s?*4(x) + P(x) + 2?*)} £0 (33) 
or 
{aP"*(a)}  {P(x) (1 + a?**)}. (34) 


If p + q = n — k then the degree of 5’**(x) can be as large as 
ptq-lean—k-1. 


The right side of (84) has a degree at most n — k — 1. Thus, by proper 
choice of the coefficients of 8?*" (a), one can always equate the two sides 
of (384). Hence, p + q S$ n — k — 1 is a necessary condition to satisfy 
(34). But max p = max gq = r so that 2r < n — k — 1 is a necessary 
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condition for the code to have sync-correction capability r. Hence the 
theorem: 


Theorem 3: The sync-correction capability of a (n,k) coset code derived from 
a cyclic code cannot exceed (n — k — 1)/2. 

Ifpt+qsn—k —1, {8?"(x)} = 6°*(x) so that if the degree of 
{P(x)(1 + 2?**)} is at least p + q, then (34) will be satisfied; i.e., 


D{P(e)1 + 2?*)} => pt (35) 


is sufficient for (84) where D[G(x)] denotes the degree of polynomial 
G(z). 

Therefore, the necessary and sufficient conditions for a code to have 
syne-correction capability + are: 


Wie (x) + P(w)1 4+ 2°)) 2tit+1 forallO <s<r _ (86) 
and 

DiP(«)A+2?")} 2p+q  forallO <p, qSr_ (87) 
where Q,(x) is the coset leader of the polynomial 6°(7) + P(«)(1 + 2’). 


3.1 Scheme A — General Approach 


A decoder which corrects up to ¢ errors while utilizing a code with 
minimum distance dm can also detect up tod = dm — (t + 1) errors. 
Thus, if every error pattern generated by slippage s S r has weight 
between ¢ + 1 and d, it will certainly be detected. It follows that these 
error patterns cannot be in @. Based on this sufficient condition one can 
design P(x) accordingly. Suppose 


d2 wei ()+P@)dat+er’)2zt+1 (38 ) 
for all random polynomials 6° (2) for0 < s S r. Let 
n—1 
P(a) = 2) Px (39) 
i=0 
and 
n—s—l 
T(x) = >. (Pit Piss)a*™, O0O<sSr. (40) 
i=0 
Then 
d—s2z W(T.(x)) 2t+1, 0<sSyr, (41) 


is necessary and sufficient for (38) to hold. In particular, it is necessary 
that W[T,(«)] 2 t+ 1. It follows that the best choice for P(a), in 
the sense of maximizing 1, is for 
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WiT.(x)] =t+1, forallO<s<r. (42) 


Those P(x) that satisfy (42) will be called optimal. In general, for cer- 
tain special codes it is possible to find other schemes which work even 
ifd —r 2t+1 is not satisfied. This possibility will be discussed later. 

It can be shown by direct substitution® in (41) that the following 
P(x) are optimal. 


[t/2] 
P(x) = dS agro gt (43) 
o=09 
if 
AVG EA cso Seed 
where 


= 1+ 2[t/2] —¢ (44) 
and [t/2] represents the integer part of ¢/2. 


TABLE III 
Number of Levy’s Result Optimal Result 
Code dm Errors Corrected (Max. Number of (Max. Number of 
Sync Loss Detected) | Sync [ose Detected) 
(23,12) 7 1 2 3 
(127,85) 13 1 8 9 
2 5 7 
3 2 5 
(255,191) 17 1 12 13 
2 9 11 
3 6 9 
4 3 7 
(255,163) 25 1 20 21 
2 17 19 
3 14 17 
4 11 15 
5 8 13 


Levy’ has found a set of P (x) for the purpose of detecting synchroniza- 
tion loss. Table III compares Levy’s resultst with some of the optimal 
results just derived. It is interesting to note that appreciable improve- 
ments are obtained by optimal P(z). 

It remains to show that the polynomials P(x) of (43) possess sync- 
recovery capability. The following theorem characterizes the extent of 
slippage the code can correct provided that d — r 2 ¢+ 1. 

*See Appendix. 


t See Ref. 9, page 11, Table 1. Note 6 2 é@+ 1 is necessary to detect sync loss. 
Table ITI is constructed by letting 6= t+ 1. 
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Theorem 4: To every t error-correcting cyclic code there exists a coset code 
with sync-recovery capability of at least 
n—k—t- [t/2] 


ela 


r< 


provided that: 
(1) the coset code is generated by 


[t/2] 
P(x) = » pe th =a0 8 a 


o=09 


(7) d-r2ztt+il 
(iii) r + [t/2]}7@ +1) -ao<n-1 


where 
oo = 1 + 2Q[t/2] — t. 


Proof: Since P (x) has sync-detection capability r when d — r = ¢ + 1, 
all one has to check is that the syndrome of error patterns due to bit 
loss are different from those due to bit gain. 

From (37) one requires 


Wa) 2 \ 
D{P(x)(1 + 2?)} =D (e3 gi 70 4 ge ) (1+ on) 


o=09 


= Dil al a ee eg ete (45) 


+2°"}Spt+q 18pt+qsr. 
Let the generator polynomial be denoted 


n—k 
g(x) = 2d gin’ ; 


then 
n—-k 


a g(x) = gov? + d gin’. 


In general, one may assume go = gn—-x = 1, and because x” = 1, we have 


n—k 


got g(x) = a ae > giv = ye a Q(x) 


or 


x" = 2" 'g(x) + Q(z). 
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Hence 

{z""} = Q(@), 
but as Q (a) has a degree exactly n — k — 1; it follows that 

Diz") =n—k—-1. 

Therefore, (45) has a degree n — k — 1 if the next highest order term, 
glt2l+) +P+4-20 is always of a degree less than n — k — 1, for alll S p, 
qs 1; 1e., 
(/2¢+1)+ptqa-o<n-—-k-1, 1 
But, max (p + q) = 2r, i.e, 


[t/2)\7 +1) +2r-—-ap<n—-—k—1 


IIA 


P, q = 7s (46) 


or 

n—k—1-+ 00 — [t/2] 
2 + [t/2] 

Recall that o9 = 1 + 2[t/2] — t, and therefore 


n—-k—t- [t/2] 
2 + [t/2] 


r< 


aed 


is sufficient to satisfy (45). 


Example: Find P (x) for (15,6) BCH code where dm = 6. 
(z) If the code is used for single error correction, t = 1 and 


d= = 64-64 LD j= rete), 
Therefore, r S$ 2, but 


ieee erat a ok Ze 
24 ([t/2) 0 
so that by Theorem 4, P(x) = 1 + 2” is a valid pattern for syne cor- 
rection up to 2 bits. 

(ii) If the code is used for double error correction ¢ = 2 and d = 3. 
Thus,d —r=3—r2t+1 = 3. Therefore, r = 0, so that it is im- 
possible to use this method but there are still other possibilities which 
will be discussed in the next section. 


9—1)/2=4 


3.2 Scheme B — Special Case 


The scheme just derived is applicable to all binary cyclic codes pro- 
vided that d, > 2¢ + 2, where ¢ is the number of errors the decoder ac- 
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tually corrects, and that the scheme is not applicable to those systems 
in which the decoder is designed to correct t = [(dm — 1)/2] errors. 

In this section, a different technique is developed. Instead of requiring 
that dm > 2t + 2, this technique requires that a set of conditions on the 
code structure be satisfied. These conditions are almost always satisfied 
by Bose-Chaudhuri-Hocquenghem codes that correct more than two 
errors. For such codes it will be shown that it is always possible to ob- 
tain a coset code that has syne-recovery capability of at least one bit, 
even if the decoder is designed to correct errors up to the guaranteed 
error-correcting capability of the code, i.e., £ = [(dm — 1) /2]. 

Definition 9: The weight of a coset is defined as the weight of its coset 
leader. 


Definition 10: Code C2 is said to be a descendant of code C, if 
aeCy>aeCe, Cy 4 C2 
and is denoted by the notation 
C1c Ce. 


Theorem 5: Cy C C2 tf and only tf go(x) | gi (x) 
where 
gi (a) and go(x) are the generators of codes C; and C2 , respectively. 


Proof: (i) Since gz (x) itself is a code word of C2 , if go (x) does not divide 
gi(v) then gi(x) ¢C2=> Cy € C2 ; a contradiction. 
(iz) If ge(a) | gi (x), then let 
r(v) = gi (x)/g2 (x) 
w(x) eC, ewe) = n@)f(e) 
= go(x)r (x) f(z) 
= wz) eCo. 
Hence, C; C C2, by definition. 


Theorem 6: Suppose code C, corrects t, errors and code C2 corrects tz errors. 
If Cy C Co and tt > te, then the coset 2 of C, that the code word K(x) « 
(C2 — C1) belongs, must have a weight of at least 2t, + 1. 


Proof: By definition (C2 — C,) is nonempty, so there exists K(x) ¢« C2 
but K (x) ¢C,. All the elements, of the coset 2 of C; to which K (x) be- 
longs, must be of the form 


K(x) + w(x), w(x) eC, 
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but, C, C C2, so 
o w(x) €Ce. 
It follows that 
K(c) + w(a) € Ce; 


thus, the weight of K(x) + w(x) must be at least 2% + 1, as Co is a 
ts-error-correcting code. It follows that Q must have weight at least 
2te + 1. 


Theorem 7: Let Q be a coset of a code C, K (x) an element ¢ Q, and suppose 
the weight of Q is R. Then the coset Q, to which K(x) + 2° belongs must 
have weight not less than R — 1. 


Proof: 
Q, = {(K(x) + 2°) + wp(x): w(x) € C} 
= {(K(v) + w(w)) + 2°: wx (a) € C} 
= {Q(x) + 2°: Q(x) € Qh. 


Since Q has weight R, each element of Q, differs from an element in 
Q by exactly one term. It follows that the weight of Q) is at least R — 1. 

The following several lemmas, in associated with Theorems 6 and 7, 
are essential to Theorem 8 which is the basis for Scheme B. 


Lemma 2: {K(x)} ¥ {a""K (x)} [mod g:(z)] 
of 
K(#) ¢€ Cy — Ci, C, CC, 

and 

gi(x)/go(x) = r(x) f (1+ 2) 
where g(x) and, g2(x) are generator polynomials of Ci, C2, respectively. 
Proof: By hypothesis, 

os +a) fr@. , (47) 
Assume 
JP STK) = {2""K(@)}— tmod gi(x)] 

i.e., 


{(l + 2)K(x)} =0 — [mod g:(zx)] (48) 
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or 

(1 + x)K(x) = f(t)m(@). 
Now, 
(1+ 2) |f@) > K(@) = fie)m(e) 


= K(x) « C; ; a contradiction. 


It follows that (1 + x) 7 f(z). (49) 
Since K(x) €Co, 
1.€., 


K(x) = fro(v)g2(x), 
by the assumption of (48) 
(+ a)K(x) = (1 + 2)fr(@)go(x) = fe)u (2) 
= f()g2(x)r (av) 
or 
(lL + a)fe(m) = f(v)r(a). (50) 


In view of (47) and (49), and by the unique factorization theorem, 
(50) cannot be satisfied. The lemma follows by contradiction. 


Lemma 8: {K(x) + 1} # {2""K(z) + 24 [mod g:(2x)] 
of 
K (a) € Ce = Cy and C7; Cc Cy . 

Proof: Assume the contrary. 
Then 

K(x) +1 + 2"7K(xz) + 2°" = f(x) (x) 61) 

= f(©)ge (x )r (x) 
but 
K (x) € C2 => ge(a) | K(x). 
The right-hand side is divisible by go(«) but not the left-hand side unless 
g.(e)| + 2"). 

The code generated by g2(x) has natural length n so that go(x) | (1 + 2”) 


but not any 1 + 2°, k <n. Therefore, (51) cannot hold and the lemma 
follows. 
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Lemma 4: {K(x)} 4 {x""*K (x) + 2” } [mod gi(2)] 
uf 

K(x) ¢ C2 — Ci, in casks OF 


Proof: Assume the contrary. 
Then 


aK (a) + Kz) +27 = f(e)m(a)r(@). 


The left-hand side is not divisible by go (x), since g(x) | K (x), so the 
lemma follows. 


Lemma 8: {K(e) + 1} ¥ {a" °K (#)} [mod g.(x)] 
of 

K(x) eC. — Ci, Ci C C2. 
Proof: Similar to Lemma 4. 


Theorem 8: Suppose code Cy and Cz, with generator gi(x) and g2(x), 
correct t, and tz errors, respectively, and 2t, > ti .If Ci, C Ce, 


g(x) /gi(x) = r(x) 4 (1+ 2) 
and if K(«) «Cz — C1, then the pattern 


ros {FE} 





if K(x) has even weight, or 


P(x) = et Hh, 


if K(x) has odd weight, defines a coset code of C; with sync-recovery capa- 
bility of at least one bit. 


Proof: First note that such P (7) always exists. Now with the P (x) used 
to define the coset code, the error pattern for one-bit loss in syne is 


{P(z)( +2) + 6 (e)} 
= {K(x) + &(x)} if K (a) is even 
= {K(7) +14 8 (2)} ={K(z) + 8(x)} if K(z) is odd. 


That is, the syndromes are either {K(x)} or {K (a) + 1}. By Theorem 
6, {K (x)} does not belong to a correctable coset of Ci since 2% > th. 
The coset corresponding to {K (x) + 1} has weight equal to or greater 
than 2t , by Theorem 7, so that it is not correctable. 
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Similarly, the error pattern for one-bit gain is 


{P(x) (1 + 2"7) + 2” 78 (x) 
= {x""[P(z)(1 +2) + 6(a)}} 


= {a" 7K (x) + 278 («)} if K (x) is even 
= {a""(K(v) + 1) + 27 78'(a)} 
= {2""K (x) + 2” 78 (x)} if K (x) is odd. 


That is, the syndromes are either 
{o" 7K (2) +a} or {e""K («)} 


By Theorem 6 and Lemma 1, {K’(x)} = {#”'K(x)} is not in any of 
the correctable cosets of C; , and by Theorem 7 and Lemma 1, 


{a""K (x) + 2""} = {2""(K(e) + 1)} 


is not correctable either. It follows that P («) satisfies the first condition 
that all the error patterns of one-bit slippage generated by P(a#) can 
not be in any of the correctable cosets of C, so that C, has syne-detection 
capability of at least one bit. 

By Lemmas 2, 3, 4, and 5 the cosets corresponding to bit-loss patterns 
can not be the same as the bit-gain patterns. Thus the second condition 
is satisfied. It follows that code C; has sync-recovery capability of at 
least one bit. 

The search for K(x) is very simple since go(x) € Co, go(w) ¢ Ci so 
that we may use g2(x) for K (a#) in every instance. 

Thus the procedure to find a coset code for use with Scheme B is as 
follows. 

(<) Find a code, C2, of the given code C; such that go(x)/gi(a) = 
ra) { (+ 2) and that 24 > 4. 


(21) Use 
g2(x) ae 
f + -} ee) 


to generate the desired coset code if the weight of go(2) is even and use 


go(x) +1 os 
Cres) 7 Pe) 





if g(x) has odd weight. 

From Theorem 8, it is easy to see that this procedure applies to all 
Bose-Chaudhuri-Hocquenghem codes whenever 2f > #, and many 
other algebraic codes. 
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The following example shows how to apply Theorem 8 to Bose- 
Chaudhuri-Hocquenghem codes. 

The polynomial and the associated sync-loss error syndromes P (x) 
for a (15,5) BCH triple-error-correcting code generated by 


a@)=(@+eti@teiPte i ¢2+ Ie tat) 


can be found as follows: 
If wis a root of x + 2+ 1 then a’ is a root of 2 + x + 1 (see tables 
of Marsh” or Peterson”) so that 


r(x) = (a +a+1) f (14+2) 





g(a) = an =(e+at+ti)(*¢tet+ev7?+24+1) 


generates a double error-correcting BCH code. 


Observe that 


therefore, 
2 > hh, 
hence, all the requirements for Scheme B are satisfied. Set 
K(x) = g2(&), 
and since g(a) has odd weight use 
P(e) = {K(@) 7 7 (- + : + 2 -|- 
=a tate. 
The syndromes are: 
(z) bit loss 


(Ki) +laetea’ t+a°4 2’ 
or 
{Kip =e te’ +aro tet t 1. 
(i) bit gain 
fe" K(t)p =e ter teateteti 


or 


fo" "K (a4) +a” 4} =a +ao4+a°4 2’. 
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It can be verified that all the syndromes listed do not belong to cor- 
rectable cosets of the given code, and the syndromes are obviously all 
different, so the modified (15,5) code has sync-correction capability of 
one bit. 

Notice that for this code 


dn = sh no 
therefore, 
d=d,— (+1) =7-4=83 
and 
i +1 = 4. 


The condition, d — r = ¢ + 1, is not satisfied so that Scheme A is not 
applicable. 


3.3 Implementation 
By the Euclidean division algorithm, one writes 
(1 + 2*)P(x) = g(x) F(x) + B.(2) 


where D[R,(x)] < D[g(x)] = n — k. 
From (28), the s-bit-loss syndromes are 


{o(v) + (1 + a*)P(x)} = [6@)} + {U1 + 2*)P(@) = O(@) + Rs(z) 


(Since D[é*(x)] < D[g(x)]) for allO <s Sr. 

It follows that all possible s-bit-loss syndromes have the same high 
order n — k — s terms [namely, the high order n — k — s terms of 
Fs(x)| and the remaining low order s-terms assume all possible 2* com- 
binations. Thus, the 2* possible syndromes for an s-bit-loss ean be de- 
tected by a single and gate that recognizes the high order n — k — s 
terms of R,(2). 

According to (28) and (80), s-bit-gain syndromes are the same as 
the s-bit-loss syndromes multiplied by 2”. It follows that a bit-loss 
recognition device, as mentioned above, can also be used to test bit- 
gain syndromes. This can be done by transforming bit-gain syndromes 
to bit-loss syndromes through multiplication by «*, 0 < s S r, then 
testing the resultant syndromes with the bit-loss recognition device. 
Such a device takes r and gates and is applicable to both Scheme A and 
B. 
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IV. CYCLIC CODES FOR DETECTION ONLY 


Some parts of the subject discussed in this section have been investi- 
gated in the literature.® Here, a different point of view is presented. 

If a cyclic code is used for error detection only, then any error pattern 
due to sync loss or gain can be detected so long as the erroneous words 
generated by sync loss are not in the code space. If r is the maximum 
amount of slippage possible (such that the misframed words are not 
code words), the code is usually said to possess comma-free freedom r. 
Codes having the property that |r| = [(n + 1)/2], are called “comma 
free’’. 

Consider a coset code C’ generated by P(x) and designed to detect 
sync loss or gain. To assure that such error patterns are detectable, one 
requires, by (28) and lemma 1, 


{1 + 2°)P(x) + &(«)} #0 (52) 


since @ = Q because every coset is not correctable for error-deteciing 
codes. 

As any syndrome has a degree which is at most n — k — 1, and 6°(zx) 
is an arbitrary polynomial of degree 6 — 1, it is always possible, for 
any P(x) to have 


{1 +2°)P(e)+e@)} =0 if BoOn-k—-1. 


It follows that the comma-free freedom can never exceed n — k — 1. 
Hence, we have the following theorem. 


Theorem 9: The comma-free freedom of any cyclic code cannot exceed 
= hod, 
One sees that by letting P(w) = 1; (52) now reads 


(i +a°%)-14+ 0()} ={14+e°4+ &(2)} = 2° + F(x). (53) 


The term 2° cannot be canceled by 8°(x) provided 6 S n — k — 1. 
It follows that the upper bound on comma-free freedom described by 
Theorem 9 can be met. Hence, we have the following result. 


Theorem 10: The comma-free freedom of any cyclic code can be made as 
largeasn — k — 1. 

According to Theorem 10, the comma-free freedom r cannot be 
greater than n — k — 1. Thus, if k 2 (nm — 1)/2, it is impossible to 
detect all the syne loss forn — k S$ r S k; on the other hand, if k < 
(n — 1)/2 then the interval between n — k, k does not exist. Hence, 
every slippage can be detected. It follows that 
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Corollary 1: An (n,k) cyclic code can be made comma-free if and only af 
k < (n — 1)/2. 


The above results have been proved in a different manner by J. J. 
Stiffler.® 

Clearly, for strictly error-detecting codes, if a received message is not 
a code word, no distinction can be made to decide whether the error is 
caused by additive noise or is due to sync loss. However, statistical de- 
cisions still can be made accurately by observing the number and fre- 
quency of word errors, and, if it is concluded that a sync loss has 
occurred, syne can be recovered by sliding the word frame until the 
number of errors observed abruptly reduces to a predetermined level. 
The penalty for such a process is, of course, the time delay and the 
loss of data. 


V. CONCLUSIONS 


Techniques for automatic word synchronization recovery are pre- 
sented. The techniques are useful if the slippage of word framing is not 
large, which is presumably the usual case. 

An upper bound on the synchronization recovery capability for any 
cyclic code is found. It is shown that, for recovery to be possible, the 
amount of slippage in bits, 7, cannot exceed (n — k — 1)/2. It is also 
shown that the synchronization-loss detection capability of any cyclic 
code is upper-bounded by n — k — 1 bits and furthermore, the bound 
can be met. 

For shortened cyclic codes, the technique has five valuable features: 

(4) No additional redundancy is required if the code corrects more 
than two errors, and only two additional check bits are required if the 
code corrects one or two errors. The generation of such check bits is sim- 
ple. 

(iz) The normal error-correcting ability of the code is not reduced 
when synchronization is maintained. 

(771) The correction of synchronization loss can be accomplished even 
in the presence of additive noise. 

(iv) The time delay required before proper framing is restored is 
small, usually one-word time. 

(v) The implementation is very simple. 

The technique has been successfully applied to an existing error con- 
trol unit which utilizes a triple-error-correcting (200,175) code. The net 
cost of the additional hardware is about 20 transistors. The circuit cor- 
rects at least one-bit synchronization loss and, with diminishing proba- 
bilities, corrects larger syne losses as well. 
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For cyclic codes which are not shortened, necessary and sufficient con- 
ditions for the existence of a suitable coset code without additional re- 
dundancy for the recovery of synchronization loss are derived; and a 
class of optimal codes is given. 

Two schemes are presented for finding such coset codes. The first 
scheme, called Scheme A, applies to any cyclic code whose minimum 
distance is greater than 2¢ + 2, where ¢ is the number of random errors 
the decoder actually corrects. 

Such a scheme is usually applicable to data systems with a reverse 
channel in which case high error-detecting ability is utilized to obtain 
very-high-accuracy transmission. This requires that the error-correct- 
ing ability of the code be reduced in favor of the detecting ability. That 
is to say, in such a case, the minimum distance of the code is often greater 
than 2¢ + 2. 

For systems using forward-acting error correction only, the error- 
correcting ability of the code is usually exploited fully so that the re- 
quirements of Scheme A may not be met. A special technique, called 
Scheme B, are developed for such situation. Instead of requiring d, > 
2t + 2, Scheme B requires a set of conditions which are almost always 
satisfied by Bose-Chaudhuri-Hocquenghem codes that correct more than 
two errors. Both schemes have the advantages mentioned earlier for 
shortened cyclic codes except there is no noise tolerance. 

Word synchronization loss is a catastrophic failure in error-control 
systems. The techniques herein described offer a solution to this prob- 
lem for all binary cyclic codes with negligible cost in hardware, in time 
delay, and without loss in transmitting efficiency in many cases. 
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APPENDIX 
The Verification that P(x) is Optimal 


n—1 . [¢/2] 
P(x) i > P;x' ee a gr Tt)—0 be got (54) 
i=0 


o=09 
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where 
r+ [t//2])7 +1)—-—a<n-1 (55) 
and 
oo = 1 + 2[t/2] — t. 
Now 
n—s—l 
Te(a) = 2a (Pi + Piss) O<sSr, (56) 
Consider 
{t/2] 
(1+ 2')P(2) = & prrnon 2) (1+ 2") 
o=09 
so S$ Cane ge Ce aoe) (57) 
o=09 
ais ms me ge : 
Note that: 


() cor +1)—o X¥or4+1)—o+sforal0O <s Er. 
(it) The exponent of the highest order term under the summation 
sign, [t/2](7; +1) —oo+s < [(/2]r +1) ~-o +r <n-—1by (55). 
Therefore, no terms of (57) will be canceled for allO < s S71. 
Note 7',(x) is the same as the polynomial (57) with the lower order 
terms x°, x’, --- , 2” ’ removed so that the number of nonzero terms 
(hence the weight) of 7,(x) is equal to the total number of terms of 


(57) minus the number of nonzero terms with exponent s — 1 or less. 
Case 1: t is odd. 


oo = 1+ 2/2] -t=14+2°>*- 6-0. 
There are two nonzero terms in (57) which have exponents no greater 
than s — 1, namely 27°°+”~” = 2° and x”. Therefore, the weight of 
T;(x) 


a 
22] +2) -2=2(4> 442) —2 


I 


t+ 1. 
Case 2: t is even. 


= PE Ol Sr St 2) SS. 
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There is only one nonzero term, namely, x’ that has an exponent no 
greater than s — 1. Therefore, the weight of 7, (x) 


= 2({t/2} +1) —1 = 2/241) -1=t+1. 
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Increasing the Memory Capacity of the 
Digital Light Deflector by 
“Color Coding” 


By J. G. SKINNER 
(Manuscript received December 23, 1965) 


Considerations are given to the use of a multiwavelength light source in 
the digital light deflector for the purpose of increasing the memory capacity 
of the system. An increase of a factor of twenty or more is theoretically 
possible when potassium tantalate niobate (KTN) is used for the optical 
switches in the digital light deflector. Practical considerations may, however, 
limit the increase to about a factor of ten. 


I. INTRODUCTION 


This paper describes a scheme for increasing the memory capacity 
of an optical beam deflecting device, such as a digital light deflector, 
by utilizing the full wavelength bandwidth of the system. For con- 
venience the scheme is referred to as “color coding”. 

The digital light deflector! is designed to deflect a monochromatic 
light beam to any one of an array of positions on a memory plane. The 
absence or presence of an obstruction at the memory plane produces a 
yes or no signal in a detector placed behind the memory plane. The 
capacity of the system is determined by the ratio of the maximum 
angular deflection of the beam to the smallest angular deflection that 
can be resolved by the system. Increasing the capacity by increasing 
the maximum beam deflection requires lenses that can operate over a 
larger angular field of view. On the other hand, increasing the capacity 
by reducing the minimum resolvable angle requires larger diameter 
components. Assuming the diameter of the components and the angular 
field of view of the lenses are at their maximum values, then other 
means of increasing the capacity must be investigated. 

The capacity may be increased by paralleling information through 
the system so that each resolvable angular location in space becomes a 
word. This may be accomplished by splitting the deflected mono- 
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chromatic light beam into M channels, each with its own detector. 
However, due to imperfections in certain components in the digital 
light deflector, a small portion of the incident light beam is deflected 
to several incorrect locations on the memory plane and also appears 
as a general background illumination on the memory. This extraneous 
light is undesirable because it reduces the signal-to-noise ratio of the 
correctly deflected spot of light. A method of reducing the unwanted 
light is to place a mirror behind the memory plane to reflect the light 
back through the deflector system to a detector located in a position 
conjugate to the light source. It can be shown? that this system reduces 
the extraneous light. In order to parallel information through this type 
of system it is necessary to code the word in the memory system, reunite 
the word for transmission back through the light deflector and then 
separate again for detection. This can be accomplished with a mono- 
chromatic light beam by coding a single pulse into a sequence of pulses 
by placing the different memory planes at different distances from the 
exit of the digital light deflector. This scheme,’ however, requires large 
aperture, high-precision lenses which are expensive. 

Another possibility is to transmit simultaneously several mono- 
chromatic beams of slightly different frequencies so that the beam can 
be reunited after coding and still be capable of yielding the coded word. 
The presence or absence of a reflector for a given frequency at a given 
location yields a yes or no bit of information. This is shown sche- 
matically in Fig. 1. The increase in capacity attainable with color 
coding is the ratio of the bandwidth allowed by the system to the 
bandwidth required per channel. 

There are several ways of constructing a color-coding system. One 
scheme is to place a number of narrowband reflection filters at the 
memory plane as shown in Fig. 2(a). However, although narrowband 
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Fig. 1— Schematic layout of the color-coding system. 
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Fig. 2— Different color-coding schemes. 


reflection filters are theoretically possible, they are not presently 
available. Another scheme is to use narrow-band transmission filters 
which will transmit one channel and reflect the other channels to their 
appropriate locations. This is shown schematically in Fig. 2(b). This 
has the advantage that no energy is lost in the beam-splitting assembly 
but has the disadvantage that a filter requires a larger bandwidth for a 
given angular beam deflection when placed at an angle to the beam. In 
order to keep the bandwidth per channel to a minimum, it is necessary 
to use the filters normal to the beam. This arrangement requires an 
achromatic beam splitter to divide the beam into M channels. This is 
shown schematically in Fig. 2(c). The disadvantage of this scheme is 
that it reduces the beam intensity of each channel by a factor of M?. 
The calculations presented below are for narrow-band filters placed 
normal to the beam. If transmission filters were used, it would be 
necessary to tilt the filter at a small angle to avoid the unwanted light 
being reflected back to the detector. This small angular displacement 
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of the filter has been ignored in these calculations. The calculations are 
based on the present design of the digital light deflector that employs 
potassium tantalate niobate (KTN) as the electro-optic material® for 
the light switches.* 


II. BANDWIDTH OF THE DIGITAL LIGHT DEFLECTOR 


The total bandwidth of the system is limited by the electro-optic 
elements that are used to rotate the plane of polarization of the light 
through an angle of 90°. Any error in this rotation allows light to leak 
through to the wrong locations in the memory plane. The light leakage 
is related to the phase difference Ab between the ordinary and the 
extraordinary rays at the exit face of the optical switch. Fig. 3 shows 
the parameters of the light beam relative to the optical switch. The 
value y is the angle between the beam and the axis of the optical system, 
and a is the angle the intersection of the plane of incidence of the beam 
and the x,y plane makes relative to the x axis. The value of A®é to the 
second order in ¥ is given by? 


(cos _ 5) a n? 
A@ =/]1+ —— — al(gu — gi2) P” (1) 
n rd 

where n is the ordinary index of refraction, \ is the free space wave- 

length, 7 is the length of the optical switch, gu and gz are the electro- 

optic coefficients, and P is the lattice polarization of the KTN crystal. 
With respect to the angular orientation a, the maximum change in 

A® occurs at a = 0° Under this condition (1) reduces to 


OPTICAL SWITCH-~— 





Fig. 3— Coordinate axes showing the relation of the optical switch to the in- 
cident beam. 


“COLOR CODING”? IN DIGITAL LIGHT DEFLECTORS 601 


je (gus = giz) P’. (2) 


2 
av = [1+ 2 a 


2n? 

We will assume that the optical switches are normally biased to 

A® = Nr and are switched to Af, = (N + 1)r forA = Ay andy = 0°. 

The error in A®,, due to \ and y being different from A» and 0°, respec- 
tively, is 





6 = Xe (gu = gi2) P 1 — nor 1 + On? (3) 
hon? wa 


where no and are the refractive indices at wavelengths \» and 4, 
respectively, and are given by the equation’ 


3.7994 
1 — (A,/d)? 


where A, = 0.2012 uw. The light leakage 6, due to the error in A®,, is 
given by 


Walt (5) 


6 = I/Iy = sin? (68/2). (6) 


Combining (4), (5), and (6) we can obtain a plot of the allowed band- 
width, AX = \ — Ao, for a given de bias of the optical switch, expressed 
by N, and an allowed light leakage 6. The bandwidth has been plotted 
in Fig. 4 as a function of N for values of 6 = 0.1, 0.01, and 0.001, and a 
beam angle y of 0.07 radians (i.e., 4°). This value of y is a typical value 
that may be used in the digital light deflector; however, the term y?/2n? 
is negligible except for \ = Xo. 

A typical bias point for KTN optical switches is N = 20 and an 
acceptable light leakage in the digital light deflector, due to color 
coding, is 6 = 0.01. From Fig. 4 we see that the available bandwidth 
under these conditions is only 20 A. The maximum bandwidth is at 
zero bias (i.e., N = 0) and equals 436 A for a light leakage value of 
6 = 0.01. 


III. REQUIRED BANDWIDTH PER CHANNEL 


The bandwidth required per channel depends on the type of filter 
used to separate the channels. We will consider a system using the 
Fabry-Perot type of narrow-band transmission filter’ that consists of 
two reflecting surfaces separated by a suitable dielectric material. The 


602 THE BELL SYSTEM TECHNICAL JOURNAL, APRIL 1966 





0.2 

0.4 = 

300 200 100 0 100 200 300 400 
BANDWIDTH AX (A) 





Fig. 4— Frequency bandwidth of the KTN optical switches. 


per cent transmission through such a filter depends on the wavelength 
and the angle of incidence of the beam. Since the beam will be de- 
flected through a range of angles it is necessary to specify the minimum 
permissible transmission 7’, relative to the peak transmission, for each 
channel. It is also necessary to specify the maximum transmission 7’ 
of adjacent channels. The light reaching the detector from the memory 
plane passes through the filter twice, so that the single pass trans- 
mission values will be the square root of T and T’. The various pa- 
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rameters of the transmission curve are shown in Fig. 5; the values 
marked with a prime are parameters of the adjacent channel. 
The peaks of the transmission for a Fabry-Perot filter occur at 


_ Amph cos 6 _ 


Q 
r 


2m (m = 1,2,8,---), (7) 
where yp is the refractive index of the dielectric between the two re- 
flectors, h is the thickness of the dielectric, @ is the angle the beam 
travels through the dielectric relative to the normal of the surfaces of 
the filter, and A is the free space wavelength of the beam. 

The shape of the transmission curve is given by 


are ee 
In 1+ F sin? (0/2) 


where fF = 4R/(1 — FR)? and R is the power reflection coefficient of the 
reflecting surfaces. If F is sufficiently large (i.e., high-reflectivity mirrors), 
then \/F = 4/e, where ¢ is the half bandwidth of the transmission 
curve. 

We require that I(T)/Io = VT at Q = 2mm + §; therefore, 
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Fig. 5 — Transmission curve showing the various parameters involved. 
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I(T) JF 1 i) 
__—s = T SS ee EE 
7 ae : 
° 1+ F sin’ (? me ‘) = + Fsin?(¢/2)" (9) 
Approximating sin(¢/2) with (¢/2) for small values of ¢, we obtain 
4 1 
2=— —=—j=e 


where 


t= 4/Fa-1- 


Similarly, the requirement that I(T’)/Io = ~/T at Q = 2mm + B yields 


26 = et! = 2¢t'/t (11) 
where 
’ 1 
oe 
From (7) and Fig. 5, 
2mm — ¢ = 4mrph cos Omax/d (12) 
2mm + B = 4auh cos Omax’/N’ (13) 
Q2na + & = Anph/dr. (14) 


From (12) and (14), 
2¢ = (Qma — §)(1 — COS Omax)/COS Omax - (15) 
From (12) and (13), and noting that Omax = Givaz's 


r+ p= Ome —9r(b-2), (16) 


Combining (11), (15), and (16), and expressing the bandwidth required 
between channels as (AA) = A — X’, we obtain 


(AX) = ACL — COS Omax)(t + t’)/2t COS Omax - (17) 
For small angles of @max , (17) may be written as 
(AX) = AOmax2(é + #’) /4E. (18) 


This equation shows that the required bandwidth per channel increases 
rapidly as the maximum beam deflection angle is increased. 
The half bandwidth of a filter is usually measured with the beam 
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normal to the filter (i.e., @ = 0°). Therefore, the half bandwidth (Ad), , 
in angstroms, is given by 








: 2 
EAN) NO max 


Av). = 
ONS Bog 2t 


(19) 


IV. NUMBER OF CHANNELS AVAILABLE FOR COLOR CODING 


Three values 7, 7’, and @nax must be specified before the number of 
channels available for color coding can be determined. 7 and T” are 
determined by the required signal-to-noise ratio of the over-all system 
which has not yet been specified. However, typical requirements of 
the color coding system may be that the intensity of each channel may 
vary by no more than a factor of two in any beam position and that 
the crosstalk between each channel be no more than 10 db. This requires 
that 7 = 0.5 and T’ = 0.05. 

The angle of incidence @ of the beam at the filter is related to the bit 
capacity N? of the digital light deflector. The number of resolvable bits 
in one direction is given by? 


N = KeD/r (20) 


where @ is the maximum deflection angle of the beam in one plane, D 
is the effective aperture of the digital light deflector, \ is the wave- 
length of the light beam, and K is a constant. The value of K is de- 
termined by the transverse mode of propagation of the light beam and 
the ratio of the linear bit separation in the memory plane compared to 
the theoretical limit; its value is about 0.2. Practical values are N = 512 
for 6 = 4° with an aperture diameter of about 2 cms. Present con- 
siderations of the digital light deflector have the arbitrary requirement 
that the center ray of the reflected beam from the memory plane can 
just be accepted by the aperture of the optical system when the beam 
deflection is a maximum along the z or y axis. This is shown schematically 
in Fig. 6. Therefore, the maximum angle of incidence of the light at 
the filter is the angle YW as determined by the extreme rays of the light 
cone when the beam is focused in the center of the memory plane. To a 
close approximation, ¥ equals 26. The angle of the beam inside the 
filter is reduced due to refraction at the surface of the dielectric layer 
in the filter and for small angles @nax = 20/u. The total number of 
channels M available for color coding is, therefore, 


_ B _ But 
~ (AX) O2(t’ + 8) 


where B is the bandwidth of the modulator. 


(21) 
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Fig. 6— Schematic showing the maximal angle of incidence of the beam at the 
memory plane. 


The total capacity C of the digital light deflector and the color coding 
system is 


C= NM = (KB) DH t 

hed Ady 

This relation shows that the total capacity of the system is independent 

of the maximum beam deflection angle but strongly dependent upon 
the wavelength of the beam. 

The maximum capacity of the color coding system for a beam de- 
flection angle of @ = 4°, also T = 0.5, T’ = 0.05, D = 2 cm, A = 5000 A, 
p = 2.45 (which is the refractive index for TiO, that is frequently used 
in Fabry-Perot filters), and an optical switch bandwidth of B = 436 A 
(which is the maximum value for a zero biased KTN switch and a light 
leakage of 6 = 0.01) is 27 channels. This means that the capacity of the 
digital light deflector that has N = 512 resolvable spots in each direc- 
tion could be increased from 2.6 X 10° to 7.1 X 10° bits. The half 
bandwidth of the filters for the above values is given by (19) as 12.6 A, 
which is well within the capabilities of present day technology. 

Unfortunately, when a KTN optical switch is operated with zero 
electrical bias, the required voltage and power dissipation are greater 
than can be accepted. The problem is the increase in the light leakage 
through the switch caused by the change in the operating temperature 
due to this increase in the dissipated power. A compromise may be 
possible, however, by using less than 27 color coding channels, hence 
reducing the required bandwidth, and increasing the allowed light 
leakage due to color coding. As an example, if the system has 10 color 
coding channels and the light leakage 6 is increased to 0.035, then the 
optical switches can be biased to N = 4; this reduces the dissipated 
power in the KTN crystal by about a factor of four compared with the 
unbiased case. 


(22) 
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V. CONCLUSIONS 


The calculations show that the capacity of the digital light deflector 
can be increased by a factor of twenty or more by the addition of a 
color coding scheme. The device could be constructed with present 
technology by making the optical switches of the digital light deflector 
from KTN crystals and by using narrow-band transmission filters for 
the channel selection, but it would have several practical problems. One 
problem is the necessity to operate the ITN optical switches at zero, 
or a very low, electrical bias. This would require higher operating 
voltages and greater power dissipation in the KTN crystals than is 
anticipated in the present design of the digital light deflector. Other 
electro-optic materials have been considered for use as the optical switch, 
but either the angular aperture of the material is too small, the power 
requirement is too large, or the material is not presently available in 
large enough pieces. An alternative is to use a stressed optical plate in 
which the birefringence necessary to produce the 90° rotation of the 
plane of polarization is induced by applying a mechanical stress. The 
wavelength bandwidth of a fused quartz stressed plate is approximately 
600 A when operated with a light beam with a wavelength at 5000 A. 
This bandwidth can accommodate 37 color coding channels. The dis- 
advantage of the stressed plate optical switch is that it is limited to 
switching times of the order of milliseconds compared to microseconds 
for a ITN optical switch. 

A disadvantage of using narrow-band transmission filters for channel 
selection is the larg. attenuation in the reflected beam due to the achro- 
matic beam divider. This can be overcome by the use of narrow-band 
reflection filters. 

One other item that the color coding system requires is a multiple 
frequency light source. The light beam should have high intensity, be 
highly directional, and have a frequency spread between each line 
equal to the channel separation. One possibility is to use a laser that 
oscillates at many frequencies simultaneously, such as the argon laser. 
Another possibility is to use a stimulated Raman source in which several 
frequencies are produced by down shifting the incident laser beam by 
Raman scattering from a suitable medium. 
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A Model for the Random Video Process 


By L. E. FRANKS 


(Manuscript received December 27, 1965) 


For problems concerning the transmission of video signals, it is often 
desirable to know the statistical distribution of power in the frequency 
domain for the signal process. It 1s convenient to have a model, involving 
only a few essential parameters, which will satisfactorily characterize the 
power spectral density of the random video signal. This paper proposes a 
model for the random picture and derives expressions for second-order 
statistical properties of the video signal obtained from a conventional 
scanning operation on the picture. The properties of typical picture material 
make valid certain approximations which lead to especially simple, closed- 
form expressions for power spectral density. The continuous part of the 
power spectral density is expressed as a product of three factors, charac- 
terizing separately the influence of point-to-point, line-to-line, and frame- 
to-frame correlation. For parameters representative of typical picture 
material there is observed an extreme concentration of power near multiples 
of the line scan and frame scan rates. An illustrative example of the use of 
the model in an optimum linear filtering problem is included. 


I. INTRODUCTION 


This paper provides a detailed development of a simple model for 
characterizing the statistical properties of a random video signal. The 
primary concern is the modeling of the power spectral density of the 
electrical process generated by linear, sequential scanning of a rectangu- 
lar portion of an infinite, two-dimensional random picture. The spatial 
and temporal statistical properties of typical picture material allow 
approximations which lead to a model having an especially simple form, 
characterized by only a few parameters. The model has a form convenient 
for the analysis of a variety of signal transmission problems. The validity 
of the model for these purposes is established by comparing it with 
results obtained in several independent experimental studies.1:?/3 

The relationship between the second-order statistics of the random 
picture and those of the resulting video signal due to line-to-line and 
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frame-to-frame correlation is examined in Section IJ. Section III con- 
siders the composite video signal wherein the picture signal is periodically 
interrupted and a periodic pattern is inserted for purposes of syn- 
chronization and blanking. A model of the random picture process is 
developed in Section IV. The results are combined in Section V to 
provide a summary of expressions for the power spectral density of the 
composite video signal. Section VI is an illustrative example of the use 
of the model for deriving optimum linear signal processing networks 
for video signal transmission over a noisy channel. A glossary of symbols 
is provided in Appendix A. 


Il. EFFECTS OF THE SCANNING OPERATION 


For the first step in the development of the random video signal, 
we consider a still picture with luminance given by the “stationary” 
random process, d(z,y); i.e., the two-dimensional autocovariance func- 
tion for the process can be described by 


i [d (a ’ 41) d (22 ? yo) =@ (2,8), (1) 


where a= %— 2 


I 


horizontal displacement 
B = Ye — Yr = vertical displacement. 


For convenience in derivation of the equations, we assume that 
d(x,y) is a zero-mean process. Although physically d(x,y) would be 
non-negative, it is easier to add in the mean in the final expressions. 
The video signal, v(t), at the output of an optical scanner moving at 
constant velocity across the picture is a stationary process with an 
autocorrelation function simply related to g(a,8). In order to avoid the 
introduction of unnecessary constants, assume that the scanner moves 
in a horizontal direction at unit velocity and also that output voltage 
is proportional to luminance with unit conversion gain. Then 


gi(7) = Elv@)o + 7)] = o(7,0) 
and | Elv(@)] = 0. (2) 
Actually, the scanner output is normally a nonlinear function of lumi- 
nance, however, the model developed here has the property that its 
autocovariance function is changed only by a multiplicative constant 


. : : * 
when subjected to a zero-memory, nonlinear transformation.” Hence, 
the model remains valid for any scanner characteristic and any in- 


_ * For the random process described in Section IV, the autocovariance is propor- 
tional to the variance of the first-order amplitude probability density function. 


A MODEL FOR THE RANDOM VIDEO PROCESS 611 


stantaneous companding operation to which the video signal might be 
subjected. 

The next step is to account for the effects of line-to-line correlation 
by considering the sequential scanning of a still picture in the form of 
an infinite strip with a finite horizontal width, traversed by the scanner 
in an interval of 7 seconds. Successive lines are separated in a vertical 
direction by a distance corresponding to the horizontal travel of the 
scanner in an interval of 7. seconds. The abrupt change in scanner 
position when it reaches the edge of the strip causes the video signal 
to be a nonstationary process. The autocorrelation for the process, 
y(t,r) = Elv(t)v(¢ + 7)] is periodic T in ¢. This is related to a stationary 
process in the usual manner by considering ¢ a random variable uni- 
formly distributed over the interval (0,7]. Then, 


g(r) = LEly(t7)] 


= = o(r — kT,kT.)P(k,r). (3) 


The probability, P(k,r), that the points ¢ and ¢ + 7 fall in lines & apart 
is given by the translates of the triangular function, qgr(7). 


P(k,r) = ar@ — kT), 


where 


i-—+— for |r| sT 


|r| (4) 
qr(r) T 


= 0 otherwise. 


Combining (8) and (4), the autocorrelation function for the video 
signal with line-to-line correlation taken into account becomes 


gol) = Do olr — REAL .)ar(s — ED), (5) 


For typical picture material, ¢(7'/2, 6) & 0. In this case g(r) is a 
sequence of essentially isolated pulse shapes, gr(7)yo(7,0), centered on 
integral multiples of 7’, the kth pulse from the origin having a magnitude 
proportional to ¢(0,k7'.) as indicated in Fig. 1. 

The power spectral density of this process is 


of) = [ ealre ar 
(6) 


ager yee” 
k=—00 
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where 
G(iKT.) = [ans o(rk Tear. 
If we let 
H(fy) = id ; G( foe ?"”"de 
then 


G(fkT.) = i H( fre re'dy, 


Now substituting (9) into (6) and using the identity 


ye Pa a = es a(x = m) 


k=—oo n= 


we get 


(=a LH (1, = (/ = *)) . 


(7) 


(8) 


(9) 


(10) 


(11) 


Again considering typical picture material, g(7,0) is narrow compared 
to gr(7) so a good approximation is gr(r)e(7,0) & ¢(7,0). In this case, 
H(f,v) is essentially just the double Fourier transform of the picture 


autocovariance function 


H(fv) = I oltre" drda. 


(12) 


A more significant consequence of the correlation in typical picture 
material is that &.(f) can be closely approximated by the product of 


$2(T) 


(0,0) — we A(T) $(7T,0) 





O T kT 


Vig. 1 — Autocorrelation of video signal with line-to-line correlation. 
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two functions, one periodic 1/7 and the other an envelope with rela- 
tively small variation over 1/7’ intervals. Since, for typical picture 
material, H(f,0) and H(0,f) have roughly the same width and since 


T/T. > 1, it follows that H(0,(Tf/T.)) is very narrow compared to 


H(f,0). Furthermore, since H(f,0) has relatively small variation over 
an interval of width 1/7, then (11) can be approximated by 


wine > Hel 5) 


weds = T m 
= 5 H(50) LH (0, ; (; )) . 
A power spectral density of the form indicated in (13) corresponds 
to the property of separability in g(7,c). Let 
g (7,0) = a? on(r) $v (c) (14) 


where ¢,(7) and ¢,(o) are normalized autocorrelation functions for 
scanning along horizontal and vertical lines, respectively. ¢,(0) = 
¢y(0) = 1; d? = ¢(0,0). Neglecting edge effects, from (12) we have 


H(f,v) = a? &,(f) &(v), 


and the power spectral density (11) becomes 
_ - Vi m 
wi = Lain Do (7 (7-4). (15) 


A sketch of this function is shown in Fig. 2. 
The final step in characterizing the effects of the scanning operation 


(13) 
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Fig. 2— Power spectral density of video signal with line-to-line correlation. 
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is to account for frame-to-frame correlation in repeated scanning of a 
finite portion of the infinite strip. Of course, if the picture is still, the 
resulting process is periodic. We consider a randomly moving picture 
with slow variation compared to the frame repetition rate, 1/N7T, where 
vertical scanning is accomplished by N uniformly spaced lines. The 
nonstationarity arising from the abrupt change of scanner position 
when it reaches the bottom edge of the picture is handled in the same 
manner as before in terms of a shaping function, gyr(r), whose effect 
can be neglected for typical picture material. Expressing the normalized 
correlation of the luminance of a picture point at times separated by k 
frame intervals by ¢,(kKNT); ¢:,(0) = 1, then 


els) = i(k T)pe(r — RINT). (16) 


Because of the slow variation due to motion, frame-to-frame correlation 
is high and ¢3(7) is essentially the product of ¢;(7) and a periodic repeti- 
tion of g(r) as indicated in Fig. 3. 

The power spectral density, obtained from (16) and the use of relation 
(10) is 


bi(f) = bf) So el BN) 


(17) 
= wr Pf) Pee ®; (5 - i). 
P(t) 
Seal Co, $4 (7) 
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Fig. 3 — Autocorrelation of video signal with frame-to-frame correlation. 
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Combining (17) and (15), the final expression for ®3(f) is given as the 
product of three functions; an envelope G,(f) representing horizontal 
picture correlation, a function G,(f), periodic 1/7’, representing vertical 
picture correlation, and a function G,(f), periodic 1/NT7, representing 
frame-to-frame correlation. 


oil | ed % (P62) | 


Lye De (7 = vr) | (18) 
Gil f) Go (f) Gif). 


As indicated in Fig. 4, the factors G,(f) and G:(f) impose a “fine struc- 
ture’ on ®3(f). In considering various smoothed versions of power 
spectral density, it is helpful to note that the average values of G,(f) 
and G,(f) are both unity. 

Some practical scanning operations involve line interlacing. For the 
conventional 2:1 interlace scan, the resulting modification of (18) is 
simple. Since consecutive lines are now twice as far apart, the factor 
G.(f) is modified by replacing T, with 27. This modification results 
in the individual peaks in G,(f) centered at multiples of 1/7’ being 
broadened to twice their original width. Since the picture is scanned 
vertically every NT'/2 seconds, the G,(f) factor is modified by replacing 
N by N/2. This effects a suppression of the terms centered at odd 


&;(f) 


®,(F) 





Fig. 4— Power spectral density of video signal with frame-to-frame correlation. 
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multiples of 1/NT. This latter modification is approximate since consecu- 
tive vertical scans are not exactly in register, however the ratio of power 
at odd multiples of 1/NT to power at even multiples of 1/NT is given 
by the ratio of 1 — ¢,(T.) to 1 + ¢,(f.) which for typical picture 
material may be less than 0.01. Derivation of the modifications for more 
complicated interlacing arrangements is straightforward. 


III. COMPOSITE VIDEO SIGNAL 


It is common practice to interrupt the video signal after each line 
scan and frame scan for the purpose of inserting control signals such as 
synchronizing and blanking pulses. The resulting signal is referred to as 
the composite video signal and the following development shows the 
form of the power spectral density. In order to simplify the argument, 
consider the following composite signal, 2(¢), which is a random process 
interrupted every 7’ seconds for a duration of a7’ seconds with an 
arbitrary periodic pattern, w(¢), inserted in the blank interval: 


z(t) = p(t) v(t) + wl) (19) 


where p(t) is periodic 7, equal to zero in the blank interval and equal 
to one in the video interval; w(t) is periodic 7’ and equal to zero in the 
video interval; and v(é) is a zero-mean random process with autocorrela- 
tion, g(r). The process p(t) v(t) is, of course, nonstationary but after 
averaging its autocorrelation function, y(t,7), over the period T we have 


a(r) = Elyt7)) 
g(r) Pr [tf and ¢ + 7 in video region] 


I 


ll 


(20) 


(1 — adel) 3) ga-or(t — KI) 


where 


[=| 


Qa-a)r(7) = 1 


= 0 otherwise. 


From (20), the autocorrelation function of the periodically blanked 
process is obtained by multiplying the autocorrelation of the original 
process by a periodic function having a shape as indicated in Fig. 5. 
Note that no periodic components are generated by blanking. For 
moderately small @ and typical video signal autocorrelation functions, 
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_ Fig. 5— Periodic shaping for autocorrelation function of periodically blanked 
video process. 


the effect of multiplying by the periodic shaping function is essentially 
the same as multiplying by the constant, 1 — a. The power in the com- 
posite signal is the sum of the power in the blanked process and the 
power in the added periodic signal since p(é)v(é)w(t) = 0. Hence, the 
power spectral density for the composite signal is 


@,(f) = (1 — a)®(f) + 8 | wi |? 6 (r = 4 (21) 


where the w; are the Fourier coefficients for w(¢). Modification of (21) 
for the actual control signal which consists of both horizontal and 
vertical synchronizing and blanking pulses is straightforward. In this 
case, w(t) is replaced by a signal periodic N7' (or NT'/2 for 2:1 interlace). 
The constant 1 — a is still just the relative amount of time devoted to 
the video signal. 


IV. MODEL FOR RANDOM PICTURE 


To complete the model for the random video signal it remains to 
characterize the luminance process, d(z,y), in such a manner that a 
useful expression for its autocovariance, g(a,8), can be derived. The 
discussion in Section II indicated the validity of the separable form 
(14) for ¢(a,8). Assuming separability, we need only model the one- 
dimensional process resulting from a unit velocity, linear scanning of 
the picture. Assume that a realization of this process is a piecewise- 
constant function, v(¢), which takes on the value v, over the interval, 
tr St < try: as shown in Fig. 6. The occurrence of the sequence {t,} 
of points along the ¢-axis is a stationary random process and the sequence 
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Fig. 6 — Random video signal. 


{v,} of random variables is also stationary. Then the autocorrelation 
function for v(¢) is 


g(r) = a? > TmP (mt) (22) 
m=0 
where im = E [vntn+ml 


@=E,’]; Ev.) =0 


and P(m,r) is the probability that the points ¢ and ¢ + 7 are in intervals 
m apart, 1.e., that m points of the {t,} sequence lie between them. 

The simplest model is constructed by assuming that the {v,} are 
statistically independent and that the {t,} are generated by a Poisson 
process with rate parameter, \. This model is the random step function 
discussed by Laning and Battin.’ For this case, (22) reduces to 


o(r) = d? P(0,r) (23) 
with 


P( mr) = ALED" ve 


hence, 
g(r) = der", (24) 


An obvious step in the generalization of this model is to consider 
correlation in the {v,} sequence. Suppose {v,} is a stationary, wide-sense 
Markoff sequence.” Then it has the property that 


oe ae (25) 


where 1; is the correlation between adjacent elements of the sequence. 
In this case, 
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(j2e> Sa a ioe 
m=0 m: 


(26) 
= @exp[—(1 — n)d[7], 

hence, this model is equivalent to the previous one (24) with replaced 

by (1 — 7)d. This result suggests an interesting alternate formulation 

of the model. Suppose that the {t,} sequence is uniformly spaced with 


separation 7’, (stationarity is accomplished by randomizing the phase 
of the sequence). For this case, 


P(m,r) = qr, (7 an mT.) + dr, (7 + mT.) (27) 


where 


po! ae epee. 


e€ 


Qr,(T) 


= 0 otherwise. 


Let {vn} be a stationary, wide-sense Markoff sequence with correlation 
p between adjacent elements, then 


i) 


o(t) = FX ol andr — 17.) 
In p 
Te 


This is a polygonal approximation to the exponential function which, 
since correlation between points 7, apart is typically very large, is a 
very close approximation. 

Using the preceding models, which are all equivalent, the random 
picture is characterized by an autocovariance 


yg (7,0) = qd? exp (—dn | 7 | — r|o|) (29) 


which depends only on the variance, d?, of luminance and two param- 
eters \, and \, which specify the average number of statistically inde- 
pendent luminance levels in a unit distance along the horizontal and 
vertical, respectively. Alternatively, the correlation is characterized 
by the parameters p, = exp [—A,7'.| and p, = exp [—),7'.] which are 
the correlation coefficients of luminance in adjacent picture elements 
when the picture area is quantized into small squares of dimension 7’, . 

The suitability of the exponential correlation function for modeling 
the random picture can be established by examining the results of 





~@exp (-—X|7r|)  whereX = — (28) 
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. 1 2 
correlation measurements reported by IKretzmer and O’Neal” and 
3 : . 3 
power spectral density measurements by Deriugin. 


V. SUMMARY 


Combining the results of the preceding sections, simple closed-form 
expressions can be written for the power spectral density of the com- 
posite video signal. At this point the mean value d of the luminance is 
included so the variance of the luminance becomes d? — d’. The syn- 
chronizing and blanking signals are assumed to occupy a fraction a 
of the total time and to form a pattern periodic NT. Let w; be the Fourier 
coefficients of the periodic signal added to d in the blank intervals. 
Then the power spectral density for the composite signal is 


Sf) = 1 — a)Gi( Na (Nas) 


20 . 30 
+ Dole Po(y— gp) + #0, 30) 


Using the exponential correlation functions of Section IV and (18), 
we have 


2dr 
(Qaf)? + i? 


The factor indicating shaping due to line-to-line correlation becomes 


Gif) = (d? — d’) (31) 


1 < 2d» 
G,(f) = T. eas scat. Na (32) 
Ew (7 a T) | + 2? 


The closed form for this expression is obtained by noting that G, (f) 
is the convolution product, 


vi Zhe 1 ™m 
QO =F, OTE PE B® 

Using the identity (10) and: performing the indicated convolution, 

G,(f) is expressed as a geometric series which can be summed to give 


sinh 7’.A, 


Gol) =. cosh 7'.A, — cos 2rT'f ’ 


(34) 
A similar expression for the G,(f) factor can be obtained by assuming 
that luminance of a point at successive frames forms a wide-sense 
Markoff sequence with frame-to-frame correlation p, = exp [—A,NT']. 
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Then 


sinh NT); 
cosh NT), — cos 2rNTf 


Frame-to-frame correlation, p;, has been measured experimentally’ 
and found to lie between 0.86 and 0.80 for typical material. Using these 
values in (35) the width (between.—3-db points) of the peaks in G:(f) 
is only about 0.048 to 0.071 of the separation, 1/N7’, between the peaks. 

In applications where the structure of G;(f) is too fine to resolve, 
the smoothed version, (1 — a)G,(f)G,(f), of the continuous part of 
the power spectral density is of interest. This quantity is shown in I'ig. 
7 for two different types of video signal; one the standard 525-line, 30- 
frame per second broadcast television signal (BCTV) and the other the 
275-line, 30-frame per second, initial design of the Bell System station- 
to-station Picturephone® service (PP).:'For comparison, it is assumed 
that both pictures have the same correlation, p = 0.9, between picture 
elements of dimension 7’, in both horizontal and vertical directions. 
Also both signals are obtained from 2:1 interlaced scanning. The differ- 
ence in the two curves in Fig. 7 is due to different values of the quantity, 
T./T. T./T is a fundamental parameter of the raster design depending 


Gf) = (35) 




















RELATIVE POWER DENSITY IN DECIBELS 

















Fig. 7 — Continuous part of power spectral density for two typical video signals 
with frame rate structure smoothed out. (p = ps = py = 0.9.) 
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on the number of lines per frame, the aspect ratio of the visible portion 
of the picture, and the relative size of the blank portions of the hori- 
zontal and vertical scans. For BCTV, 7./T = 0.00128; and for PP, 
T./T = 0.0041. 

The value of the parameter p used in Tig. 7 represents a highly detailed 
random picture. A typical head-and-shoulders view of a person may 
have p = 0.99 and p = 0.98 represents a moderately detailed picture.’ 
Even for p = 0.9 the power is extremely concentrated around multiples 
of the line scan rate, 1/7. The width of the peaks between —3-db points 
and also the ratio between successive maxima and minima in the power 
density are shown in Table I for various values of p and AT... 


VI. APPLICATION 


One obvious application of the model is in problems concerning mini- 
mum mean-squared error filtering of the video signal in noise. Solution 
of these problems invariably requires a knowledge of power spectral 
densities of the signal and noise. The concept of utilizing the inherent 
redundancy in the video signal to ease transmission requirements is 
familiar. Alternative to seeking coding arrangements which reduce band- 
width requirements, we can consider linear processing operations which 
utilize the highly nonuniform nature of the power spectral density to 
effect. a reduction of signal power needed for a given performance. 

As an example of this approach, consider the design of optimum pre- 
emphasis and de-emphasis filters for transmission over a noisy channel 
as shown in Fig. 8. Assume that the channel has a constraint on maxi- 
mum signal power and that it is desired to minimize mean-squared error 
in the received signal, y(é). 

Details of the derivation of the optimum filtering characteristics 


TABLE I — WiptH anp HEIGHT oF PowER DENSITY CONCENTRATIONS 
Axsout MULTIPLES OF THE LINE SCANNING RATE FOR 2:1 
INTERLACED SCANNING 











idth — S(n/T 
p Me=—Inp —|_ pridth etween 73 db | 10 logie ga 
0.99 0.010 0.00636 40.0 db. 
0.98 0.020 0.0127 34.0 
0.97 0.030 0.0191 30.4 
0.95 0.051 0.0326 25.8 
0.90 0.105 0.0668 19.6 
0.85 0.163 0.1035 15.8 
0.80 0.223 0.1420 13.2 
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CHANNEL 


oF 
G(F (2) F(f 
POWER SPECTRAL | E+ Gexnt+h ez 


DENSITY, S(f) 


NOISE, z(t) 
POWER SPECTRAL DENSITY, N (f) 


oO, 
Ps= sf) |G (A) |? df = SIGNAL POWER ON CHANNEL 
-0o 


Fig. 8 — Spectrum shaping for transmission over noisy channel. 


are presented in Appendix B. For high signal-to-noise ratio on the 
channel, the two filters are essentially inverse and we get* 





Sif) } 
F(f) = @“(f) = E | 36 
(N= OW =| ie (36) 
where the constant p is adjusted to meet the signal power constraint. 
+_ 1 ; 
=p [ (sw)! af (37) 


The advantage gained by using the filter networks can be expressed as 
a signal-to-noise ratio improvement factor y which is simply the ratio 
of signal-to-noise ratios at the output of the receiver with and without 
filtering. 


_ __ fSdf [Ndf 
Y= TSF af [NF*dt (38) 





When the optimum filter pair (86) is used, (88) becomes 


_ SSdf [Naf 
Yopt = [FCSN taf)” (39) 


The expression for yop provides another interesting measure of the 
nonuniformity of S(f). In a function space representation, the quantity, 
cos! (1/Yopt)’, is conventionally interpreted as the angle between the 
functions S'(f) and N id). If we consider flat noise over a band W 
and zero outside, then yop_ becomes a comparison of S?(f) with a con- 


* The expressions for S(f) use only the continuous part of the expression. The dis- 
crete componenets should not be transmitted in this problem. 


624 THE BELL SYSTEM TECHNICAL JOURNAL, APRIL 1966 


stant over the band of interest. In this case, 
2W J Sdf 


Yort = Wes 2+ 
Loa 
-Ww 


Values of Yopt in (40) are plotted in Fig. 9 for the case, W = 60/T 
and 7'./T = 0.0041; these parameters corresponding to the Picture- 
phone video signal. 

The form of the optimum filter, in this case F(f) = S‘(f), suggests 
a rather difficult realization problem. Accordingly, it is interesting to 
evaluate the performance of a suboptimum filter pair having the simple 
form shown in Fig. 10. The parameter, a, in the network realizing the 
periodic part of the transfer function is adjusted to match the maxima 
and minima in S*(f). The values of y using this filter pair are also shown 
on Fig. 9 and are seen to be remarkably close to the optimum values. 
It is of interest to note that part of the pre-emphasis filter effects a 
smoothing of the transmitted power spectral density by transmitting 
only the partial difference, x(t) — ax(t — T), between successive lines. 
This technique has been discussed by Harrison’ and O’Neal” with regard 


(40) 


(3 
slo 


SIGNAL-TO-NOISE RATIO IMPROVEMENT IN DECIBELS 





SCE 


= 
a 


fo) 
0) 0.025 0.050 0.075 0100 0425 O4150 0.175 0,200 
ATe 


Fig. 9 — Performance of spectrum shaping filters. (W = 60/7’, T./T = 0.0041.) 
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=i 
|F (#)|? =[(emt)? +)? . [i+a2-2a cos amt] 


(b) 


Fig. 10 —Suboptimum filter pair; (a) pre-emphasis network, (b) de-emphasis 
network. 


to the use of “‘previous line’’ linear prediction to reduce the correlation 
present in the transmitted signal. 

In actual practice, a substantial portion of the indicated advantage 
may not be realizable. This is because the received noise, in passing 
through F(f), is concentrated at multiples of 1/T causing a line-to-line 
correlation which is subjectively more annoying than the same amount 
of flat noise power. This subjective effect has not yet been fully evalu- 
ated. If it can be described by a frequency domain weighting function, 
then the methods presented in Appendix B can be easily adapted to 
give a more accurate evaluation of optimum filtering. 


APPENDIX A 


Glossary of Symbols 


d (x,y) luminance at the point (x,y) on the picture. 
¢ (a,B ) autocorrelation of picture luminance. 
gi(r) autocorrelation of video signal obtained from linear 


horizontal scanning of infinite picture. 
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go(r) autocorrelation of video signal obtained by se- 
quential horizontal scanning of infinite vertical 
strip. 

¢3(7) autocorrelation of video signal obtained by re- 
peated scanning of rectangular portion of moving 
picture. 

gn(t), Gv (7) normalized versions of g(7,0) and ¢(0,7), respec- 
tively. 

gr(r) normalized autocorrelation of luminance at a 
point as a function of time. 

Gi(f), G(f), Gi(f) factors of the power spectral density characterizing 
point-to-point, line-to-line, and frame-to-frame 
correlation, respectively. 


S(f) power spectral density of composite video signal. 

w(t) periodic part of composite video signal less the 
average luminance, d. 

W1 lth Fourier coefficient of w(t). 

a relative amount of time occupied by non-video 
portion of the signal. 

z line scan interval in seconds. 

T. time interval equivalent to distance between 
adjacent lines at scanner velocity. 

N number of lines per frame. 

Nice Xe Poisson rate parameter describing luminance 
process in horizontal and vertical directions, 
respectively. 

Ph y Po correlation between luminance values in adjacent 


square picture elements, of dimension 7’, in 
horizontal and vertical directions, respectively. 


APPENDIX B 


Optimum Filtering of Random Video Signals 


The criterion for optimum performance is expressed in terms of mean- 
squared deviation between the received signal, y(¢), and the transmitted 
signal, x(t), shown in Fig. 8. The received signal is decomposed into 
distorted signal component, u(é) = FG-x(t), and noise component, 
v(t) = F-2(t). 

Let 


I= El[(z— y)' 


(41) 
= E[(z — u)] + El] — 2E[@ — uv. 
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The last term in (41) vanishes by assuming statistical independence of 
signal and noise and zero-mean for the noise. The two remaining func- 
tionals are expressed in the frequency domain as 


r=fsii-rnemrat+ fvolrnra. (2) 


We want to find the filter transfer functions F(f) and G(f) such that I 
is minimized subject to the constraint on signal power on the channel. 


[slew Pap=P.. (43) 


Problems of this type appear to have been first discussed by Costas.° 
It can be shown that if the signal-to-noise ratio on the channel is moder- 
ately high, then F(f) and G(f) are essentially inverse. Accordingly, 
we add the constraint, G"(f) = F(f), which makes the first term in 
(42) vanish, and find the stationary points of J + uP, with respect to I’. 


1+uP.= [NF atu f sea (44) 


where we assume F to be a real function (since its phase does not affect 
signal power or noise power) with the understanding that the pre- 
emphasis filter can have an arbitrary phase shift since the de-emphasis 
filter has the complementary phase shift. In order that the first variation 
of the functional in (44) vanish, it is necessary that 


pe us ; 
r= (#8) a 


where 


¢. 4 
w= 5 | (sw) af 


in order to meet the constraint on signal power. Substituting (45) into 
(44) the minimum mean-squared interference becomes 


Imin = yl | | (SN)} af). (46) 


Because of the constraint, G’ = F’, it makes sense to speak of signal-to- 
noise ratio at the output of the receiver. The improvement in signal-to- 
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noise ratio by choosing F according to (45) relative to = 1 is expressed 
as 


[sa] wa 


[ f eswy' ar 


If we assume that N(f) is constant over the band | f| < W and zero 
elsewhere, then (47) becomes 


Yopt = (47) 


aw | Saf 


Yort = Bay 
| S’ ir] 
—W 


Now let S(f), as indicated in Section V with \, = \, = A, be given by 


sf) = K| 2) | sinh AT, 
)) = (Q2rf)” + ° | Lcosh AT, — cos 2r7'f 


(48) 


and since 
[ si af = 9(0) = 
we have 


_— Clases lene Se ‘ah. (4 
Yopt ™ _w (Qn f)” + ” _cosh AT. — cos 2rTf J ie 


Integrals of this type, having an integrand A(f) B(f) where B(f) is 
periodic 1/T and A(f) is a slowly changing envelope function, can be 
closely approximated by 


[anpnae S(t) [ sna 60 


=r[ anal BO) at 


Accordingly, we evaluate 


vr 2h (2a)? 4 Ww 
Clase) e~ oy AO ae 


IV 


1 
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and using a suitable change of variable on the second integral 


i sinh AT’, : g aA 7 AT. Se dy 
1jr cosh AT, — cos 2r7f f= ae. (1 — k sin’ g} 
where 
1.03 AT. 
k 2° 





This last integral is recognized as the complete elliptic integral of the 
first kind, K (sin™ k), which can be obtained from tables. Now com- 
bining these results, 


Ww 2 2 
| Ss a) -8 le ATe FT In alll x” (sin k) (51) 
— W 





and (49) becomes 
x W 
An 


tanh sei "| In | K?(sin7! k) 





Venlad = = (52) 





This function is plotted in Fig. 9 over the range of typical values of X. 

For the suboptimum filters shown in Fig. 10, we want to evaluate y 
in (88) for the flat noise case. The integrals are evaluated using the 
approximation indicated in (50). 





2 
VOSS nl mSiZLe (53) 
—_ alm at [1 + a? — 2ae7] 
In (53) the parameter a is selected to satisfy 
(1 — a/1 + a)’ = tanh XT ,/2. (54) 


This choice of a makes the ratio of maxima to minima in the periodic 
part of the transfer function equal to that of the optimum filter. Values 
of y(A) from (53) with 7, replaced by 27, for 2:1 interlace scanning are 
also shown on Fig. 9. 
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Theory of Cascaded Structures: Lossless 
Transmission Lines 


By B. K. KINARIWALA 
(Manuscript received January 12, 1966) 


Cascaded structures play a major role in many signal processing and 
signal propagating systems. The universality of such structures ts particu- 
larly evident when the signals are of a wave nature, 7.e., the components of 
the structure are representable by transmission lines rather than lumped 
elements. Transmission media with discontinuities are examples of such 
structures. Other examples include integrated, microwave, and optical circutts. 

Theory of distributed structures has, so far, been successfully developed 
only for structures whose components are lossless (or RC) transmission lines 
of equal electrical lengths. It is the purpose of this paper to present a theory 
of cascaded structures when the component elements are lossless transmission 
lines of arbitrary electrical lengths. Extensions of the theory developed here 
to other structures will be discussed in a subsequent paper. 


I. INTRODUCTION 


1.1 Purpose 


A large class of signal processing and signal propagating systems 
takes the form of a cascade of elementary two-port, linear transducers. 
For example, in the classical filter theory cascades of constant-k, m- 
derived sections, etc. and in the modern network synthesis cascades of 
transmission-zero sections form the conceptual basis. Integrated circuits 
utilize RC transmission lines in cascade. In microwave filter theory, the 
structure takes the form of a cascade of quarter-wave transformers. 
Optical filters incorporate the same idea in multilayer dielectric thin-film 
structures. In propagation problems, one typically encounters waves 
(electromagnetic, acoustic, etc.) travelling in cascades of transmission 
media and discontinuities. These are but a few examples to indicate the 
importance and universality of such structures. 

It is the purpose of this paper to present a theory of such structures 
when the component two-ports are representable by uniform lossless 


631 


632 THE BELL SYSTEM TECHNICAL JOURNAL, APRIL 1966 


transmission lines of arbitrary electrical lengths. Extensions of the 
theory to include other structures as well as lumped network elements 
will be discussed in a subsequent paper. A secondary aim of this paper is 
to incorporate into the theory those algorithms for analysis and synthesis 
that are most appropriate for computing purposes. 

The distinguishing feature of this study is the novel formulation for 
the transmission matrix specifying each transmission line. The total 
signal quantities at input and output of the line are related to each other 
in terms of the forward and backward travelling waves in the line. In the 
past, results have been obtained only for those structures in which the 
component transmission lines are of equal electrical length. The new 
formulation presented here leads to a complete theory of lossless lines in 
cascade. The analysis and synthesis algorithms obtained here are par- 
ticularly simple and straightforward. They appear to be quite promising 
for computation. 


1.2 Background 


In the theory of lumped networks, extensive literature exists on cas- 
caded (lumped) structures. The difficulty arises when some or all of the 
component two-ports consist of distributed elements. In the case of 
lumped elements, the system functions are defined by rational functions 
of the complex frequency variable for which there exist many well- 
known mathematical results. When distributed elements are present, 
the system functions involve transcedental functions of the complex 
variable with a consequent increase in complexity. It has been possible 
in the past to obtain significant results only for certain classes of trans- 
mission line structures by applications of Richards’ transformation. 
In particular, Richards! showed that distributed structures consisting 
only of uniform, lossless transmission lines of equal electrical lengths are 
equivalent, under a change of variable, to lumped networks. Many 
techniques and results of the lumped network theory can thus be carried 
over to such a class of distributed structures. Ozaki and Ishii? applied 
such a transformation to obtain physical realizability conditions for 
such (i.e., uniform, lossless, and equal electrical lengths) transmission 
lines in cascade. The same results have been better formulated and 
extended by Riblet.? An interesting root-locus approach has been used 
by Seidel‘ to derive the realizability of insertion loss functions. Finally, 
Shih has recently used the same idea to obtain some results in the time 
domain. All of these results are obviously directly applicable to cascades 
of RC transmission lines of equal electrical lengths again by a simple 
change of variable. 
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1.3 Results 


An entirely new formulation in terms of the forward and backward 
waves in the component transmission lines of arbitrary lengths is de- 
veloped in this paper. Such a formulation is then used to obtain several 
significant results. Specifically, these results include: 

(z) A method of analysis which allows one to write down, by inspection, 
the system functions of the cascaded structures. The expressions for 
these functions are obtained explicitly in terms of the physical parameters 
(characteristic impedances, propagation constants, etc.) of the com- 
ponent lines. 

(27) Physical realizability conditions for system functions of cascaded 
transmission lines. 

(zit) A synthesis method which is simple and appears to have the 
distinction of minimizing computational errors. 


1.4 Organization 


We begin with a statement of the problem in complete generality but 
we end up with restricting it to the case of interest here (i.e., cascades 
of uniform, lossless transmission lines). A summary of results for the 
equal length case follows. We then proceed to introduce our new formu- 
lation and discuss the lossless case in detail. The ideas developed for the 
lossless case will be extended to other structures in a subsequent paper. 


Il, STATEMENT OF PROBLEM 


In its completely general form, a cascade of linear two-ports may be 
represented as in Fig. 1. Each component two-port may be characterized 
by any one of numerous relationships between the various signal param- 
eters at the two ports. The most convenient one for a cascade structure 
relates all the signal parameters at one port to those at the other. The 
signal parameters that we shall use are the voltages and the currents at 
the several ports. Other parameters (such as forward and backward 
waves and many others) can also be used; but, these are not so con- 





Fig. 1— The cascaded structure. 
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venient. The conventions of positive directions for voltages and currents 
are also shown in the figure. 

The input and output signal parameters for each two-port are then 
related by a transmission matrix for that two-port. Thus, 


Sra = TS. (1) 


where S; is the signal vector whose elements are {V; , x} and 7’, is the 
transmission matrix for the kth two-port. It follows that 


So = T1T. +++ TraSn = TSnr (2) 
and 
T = T1T.+++ Th. (3) 


Equation (3) allows us to study the properties of the composite trans- 
mission matrix 7’ in terms of those of the component matrices 7’, . Our 
interest is in the methods of analysis and synthesis of such structures. 
These are carried out conveniently in terms of some scalar system func- 
tion of the complex frequency variable s = o + jw. The system func- 
tions that we shall be concerned with are the impedance function 


Vo(s) 
Tats) ? (4) 


and the transmission (or insertion) loss function 


Zo(s) = 


az Ve(s) Rr 
8(8) = arty W Re ve 
where Vg is the voltage of the source and appropriate resistive source 
and load terminations (Rg and R,) are assumed. The above functions 
are simply related to the elements ¢;;(s) of the matrix 7. 


tuk, + te 
Zo so eee ae 
(s) tale, + toe (6) 


and 


tuftt + te + tarlcRr, + tolte ; 
2VRekt 


In this paper, our interest is limited primarily to those structures 
that are representable as cascades of uniform lossless transmission. lines. 
The component two-ports are thus lossless transmission lines whose 
ends are the ports (Fig. 2). The component matrix 7; can now be ob- 
tained from the transmission line equations which, under zero initial 


O(s) = (7) 
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— 
Ve-if {Vi 


Tig. 2— A single section of lossless transmission line. 


conditions and Laplace transformation with respect to the time variable, 


are: 
Vis) 0 —sl, V(s) 
: = (8) 
I (s) = sCh 0 I (s) 
where V and J are the derivatives with respect to the distance variable 
x. For the kth line, it follows that 


Vi-1(s) cosh sr; R, sinh sr; | [ Vi(s) (9) 
I,-1(s) R,° sinh sr; cosh s7; I,(s) 
where L;, and C’, are the inductance and capacitance per unit length of 
the line, 


Tk = VLC ie 
Re = WL;/C, = characteristic impedance, 
and 2; = physical length of the kth line. We thus have 
| cosh sr; FR, sinh " 


—-l > 
R, sinh sr; cosh sr, 


electrical length, 


T, = (10) 


We shall use (10) to derive most of our results. 


III, EQUAL ELECTRICAL LENGTHS (LOSSLESS) 


In this section, we briefly summarize the known results that have been 
obtained for the case of transmission lines of equal electrical lengths, i-e., 


Te UF for all k. (11) 


Actually, 7 has the dimension of time and it is the time of propagation in 
each line. It is commonly expressed as a fraction of the wavelength, 
hence it is called the electrical length.* 


*In the sequel, it will simply be called “the length”; when physical length is 
meant, it will be so specified. 
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The transmission matrix 7’, is now dependent only on the parameter 
R,, . Now, it is clear from (6) that any factor common to all ¢;; cancels 
out in the expression for Z, . If we make all matrices 7’, rational in some 
variable, except for a scalar multiplier, then the function Z, will also be 
rational. From (10) and (11), it is apparent that cither the hyperbolic 
cosine or sine is the scalar multiplier if we use the transformation 


p = tanh sz, 
or 
= coth sr. (12) 


The matrices 7’, are then all rational in p (except, of course, for the scalar 
multipliers) and so, Z, will also be a rational function in p. 

It should be observed that (12) maps the real and imaginary axes into 
the real and imaginary axes, respectively, and the right half-plane into 
the right half-plane. It is this fact together with the rational Z, that al- 
lows us to draw upon the theory of lumped networks. We summarize 
some of the important conclusions. First, we choose p = coth s7 and 
observe that 


— pZi(p) + Re = a 
Zr-1(p) = Ri Z,(p) + p’ k= 1,2, yr (13) 


where Z,(p) is the impedance 


Vi.(s) 


Zx(8) = Ti(s) 


(14) 





under the above change of variables. 
A basic theorem for the physical realizability of cascaded lossless equal 
length lines is as follows. 


Theorem: The necessary and sufficient conditions that Z, , a real rational 
function of p of degree n be the input impedance of cascaded lossless equal- 
length lines terminated in a resistor are: (i) Z. is a positive-real function 
of p, and (it) even part of Z, has only the n-fold zeros at p = +1. 

The necessity of condition (7) follows from (13) by observing that the 
real part of Zz is non-negative for all values of p with non-negative 
real parts whenever the real part of Z; is also non-negative for those 
values of p. By iteration of (13) the first condition is seen to follow. The 
second condition follows from the determinant of 7’, which is (p* — 1) 
if we neglect the scalar multiplier sinh sz;. From (8), the determinant 
of T is (p’ — 1)”, neglecting the scalar multiplier again. But the deter- 
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minant of (10), is given by the difference of products of the even and 
the odd polynomials in the numerator and the denominator of the im- 
pedance function Z, . This difference is also the numerator of the even 
part of Z, and the second condition is seen to be necessary. Sufficiency 
of these conditions can be shown by an actual constructive synthesis 
procedure using the inverse relationship of (13), v7z., 


pLr-r — Ry 


LZ, = ———_ 
: p— RetZra 


(15) 


From (15), one can show that if Z,_1 satisfies the above conditions, then 
so does Z;, and it is of a lower degree. The process ultimately terminates 
yielding the load resistance. 

Other results in the p-domain include explicit expressions for the 
coefficients of the input impedance im terms of the characteristic im- 
pedances of the lines and vice versa.® There is also some discussion on the 
realizability of the transmission loss functions.? These results follow 
from the basic theorem above. 

An interesting departure from the above is the time domain investiga- 
tion of the same structure.” No physical realizability conditions are 
available in the time domain; however, the synthesis procedure is con- 
ceptually quite simple. The system function used is the (impulse) reflec- 
tion function in the time domain which takes the form of an infinite 
series of equally spaced impulses. The first impulse at ¢ = 0 can only result 
from the first discontinuity thereby yielding Ri. The second impulse 
(t = 27) results from the second discontinuity and yields Ry» (since we 
know &,). The third impulse (¢ = 47) results from the third discontinuity 
as well as multiple reflections encountering the first and second discon- 
tinuities. Since the only unknown in all these discontinuities is the third 
one, it is uniquely determined and yields R;. The process continues and 
every new impulse determines the next characteristic impedance until 
all the junctions are specified. The rest of the impulses are then sums of 
the multiple reflections from all the junctions and the synthesis is com- 
plete. 

The major drawback of the time domain approach is that there are 
no concise physical realizability conditions available. 


IV. LOSSLESS CASE (GENERAL) 


In this section, we consider the general case of lossless transmission 
lines of arbitrary lengths in cascade. The transformation (12) no longer 
reduces the system functions into rational functions. In fact, it is no 
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longer possible to think in terms of rational functions. We must, there- 
fore, abandon the previous approach and start fresh. 

We begin with some physical observations. The structure is certainly 
passive and so the impedance function Z, must be a positive-real func- 
tion of s. The component two-ports as well as the cascade of them repre- 
sent reciprocal structures and so the determinant of 7, as well as that. of 
T must be unity. This follows from the reciprocal property in general 
and can be verified from (3) and (10) directly. The next observation 
stems from the time delay property of transmission lines. As mentioned 
earlier, the length of the transmission line 7, represents in reality a time 
delay of 7, seconds between the input and output signals for the kth 
line. The cascade structure, of course, distorts the signal but we can still 
speak of the time delay as the time interval between the start of the 
input signal and that of the output signal. This is the time delay that an 
impulse will undergo, v72., 


Me 


Tk (16) 


T= 
k 


tt 
— 


In this same cascade structure, however, each component line may be 
viewed as a delay line of length 7, . To bring the parameter 7; in promi- 
nence, we can look upon the line with its discontinuities at the two ends 
as a spatial resonator for an impulse. If we can make these elementary 
resonators 7; explicitly apparent in the system functions, we would be 
able to identify the several lines. It is this fact that motivates the formu- 
lation that we shall pursue. 


Let 
2=¢ (17) 
so that 
Bee, (18) 
This maps the left half s-plane into the unit disc whose boundary 
| z| = 1 corresponds to the imaginary axis of the s-plane. Then 
T, = 3[Axte + Ax & 4), (19) 
where 
1 +R, 
A,* = a ; (20) 
+R, 1 


Equation (19) expressed 7’, directly in terms of the forward and back- 
ward wave delays 2 and z**. It would be more meaningful to express 
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7 = —27, A 
(19) in terms of the delay terms, z “*, and the terms, z “*, corresponding 
to the elementary resonator. However, we shall find positive exponents 
of z more convenient to use and when necessary it is always possible to 
revert to the negative exponents. Hence, we shall have occasion to use 


= 1 
Qet 


T 


T, [Ayre + Ax). (21) 





V. LOSSLESS CASE —— ANALYSIS 


It is desirable in many cases to study the behavior of system functions 
for different values of physical parameters of the system. In such cases, 
it is necessary to bring out explicitly the dependence of these functions 
on the system parameters. We proceed to do so by first expressing 7’ in 
terms of these parameters. From (8) and (19) 


T = I] 5 [A, te" + A, 2 "I, (22) 
k=1 
or 
PSO DAA. tA ee (23) 
and wu, = +1 when a coefficient (24) 


-+&+ when a superscript. 


The summation above is over all possible combinations (ww, %,--- , 
un). Thus, there are 2” terms in all. Each of these terms needs to be ex- 
amined further to make (23) meaningful. First, however, let us observe 
that the matrix 7’ as well as the functions Z, and © can be all obtained 
very simply from 


y = 1, (25) 


where y = (“) and v = (*) . For example, 
2 2 


Z,=% (26) 
Yo 
and 
2/RaRLO = 9 + Rey, (27) 


where, in (25), 


ps Ce). (28) 
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The elements of matrix T are obtained by letting the vector v have ele- 
ments {1,0} and {0,1}. Our interest will, therefore, be in obtaining y in 
terms of v. It follows from (23) that we need merely obtain 


vp = A,"1Ay” ++. AnD, (29) 


We show below that A;"* is singular and so v’ is obtained by successive 
projections of a vector onto the appropriate eigenvector. Let 


sts 1 Uzley, 
AS || eee (30) 


It is obvious that the above matrix is singular (u, = +1). Its nonzero 
eigenvalue is at \ = 2 with the eigenvector 


€5"* = ew : (31) 


et = ea (32) 


It should be clear that A;* operating on any vector v results in two times 
the projection of v onto e:"*. Or, 


its other eigenvector is 


Ax*v = uRy (vy + vourRn)er”*. (33) 
Then, from (29) 


n—l 
rs) V1 Unqilenyi 
v = es (v + | I] (1 + Sa ae ) ; (34) 


Finally, we obtain 


n—1 
y = o-” Digtitite tintin | ( ae seat) | (- ae =) 5" ; (35) 


ki UR, 





where the summation is again over combinations (1 , U2, +--+ , Un). 
Equation (85) expresses the system functions as well as the composite 
matrix explicitly in terms of the system parameters 7, and R, . Further 
simplifications in (85) are possible for special situations. However, the 
important thing to be emphasized here is that we have an explicit ex- 
pression in scalar form for the elements y; and y, and therefore for all 
system functions of interest. For computational purposes, (85) can be 
expressed in terms of hyperbolic cosine and sine terms. For discussing 
physical realizability, it would be more convenient to eliminate all nega- 
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tive exponents of z in (35). This is accomplished by using (21) or, equiva- 
lently, by considering z’y since the highest negative exponent in (35) is 
7. The impedance function Z, will be now a ratio of functions involving 
only positive exponents of z. 





T 
z2Y1 
Z(z) = ——. (36) 
ZY2 
VI. LOSSLESS CASE — PHYSICAL REALIZABILITY 


The basic results will be derived for the realizability of the impedance 
function Z,(z). It is then easy to carry over the results to determine the 
realizability of other system functions. Let Z, be expressed in the form 


= 2a, 
N(z) _ >» Ape 


k= 
(z x bpz” th ; 
k=0 


where N and D are finite sums as shown and have no common factors, 
i, are nonnegative and increasing with k. The coefficients a, and b, are 
real and both are not zero for any k. 
The necessary conditions that must be satisfied have been mentioned 
before (see Section IV): 
(7) Z, must be a positive-real function of s, or 


o 


Zo(z) = 





(37) 


S 


Re Z(z) 2 0 for [eases ol 
(11) Determinant of T is one. Since we are considering (2’T') 
det. (°T) = 27 = 2. 


(see (23), (86), and (87)). 
The second condition must be somehow expressed in terms of N and 
D. To do this, observe that except for a constant positive multiplier, 
2'T is a product of matrices of the type 


(27! +1) R,(2'* — . ee oo 
ob oe 1; 2 2 a k k (38) 
Ryo (2 — 1) (2 +1) gz (2) gi (2) 
where for all k 
fiz) 
fo(z) 


2g, (1/z) 
—2z "9, (1/z) 


2™*f (1/2); gi (2) 
—2"fa(1/z);  ge(z) 


I 
Il 


(39) 


I 
Il 
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and 
det. Te = fign — fog - (40) 
Observe that the product of 7,’ will yield 
F F 
TY’ = 1(z) a (41) 
G2(z) Gi(z) 


where I’, , F2, Gi, and G2 satisfy the same type of relations as fi, fe , 
gi , and ge , respectively, v2z., 


Fi(z) = 2"F,(1/z) 


(42) 
Fe(z) = —2"F (1/2), etc. 
Also, if c > 0 is a constant, 
z* = det. (°T) = c det. (T’) = c(FiG, — FG2). (43) 
Then, if 
N=N,+ No (44) 
and 
D=D,+ D2, 
where 
Ni(z) = 2""Ni (1/2); Di(z) = 2""D, (1/z) is 
No(z) = —2""N2(1/z); — Do(z) = —2""D,(1/z), 
we can express, using (41), 
1. DED” OREO. (4) 
The condition (2) now can be expressed using (43) and (46) as 
N,D, — N2D2 = cz” (47) 


where c is again a positive constant. 

It is further possible to simplify the statement of the necessary con- 
ditions. Z, is a positive-real function for |z| 2 1. Consequently, it is 
also an analytic function for all | z| = 1, hence one need verify the non- 
negative property of Z, only on the boundary |z| = 1.* On the unit 





* For a justification of all such statements, see the Appendix. 
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circle, 
Re Zo | jim = 21Z0(2) + Zo(1/z)]yej— - (48) 
Define 
EvZo(z) = 3{Z.(z) + Z.(1/z)]; 
then, using (44)-(46), 


—_ 1{ Nile) + Nelz) Ni(z) — Ne(z) 
Eva,(z) = 5 pe + Dilz) * —Dele) + D al 


(49) 
_ | Mate ue) = Nate) rte) | 
D(z) D(1/z)22*» ; 
Substituting (47) in (49), we have 
c 
EvZ(z) = D(z)D(/z) ) (c= 0). (50) 


Observe that the real part of Z, is always positive since c is positive. 
It is zero only if c is zero and this can happen only if R, = 0 or ©. The 
two necessary conditions are thus equivalent to: 

(t) D(z) is Hurwitz-type, i.e., all its zeros lie in the interior of the 
unit circle. 

(ii) EvZ, = (D(z)D(1/z)|!'; | c = 0. 

These alternative conditions are easier to check. In any case, we now 
state and prove the physical realizability conditions in the following 
theorem. 


Theorem: The necessary and sufficient conditions that Z,(z) be an im- 
pedance function of a resistively terminated cascade of lossless, uniform 
transmission lines are: . 

(i) Z.(z) is a positive real function for |z| = 1. 

(77) NiDi — N2D2 = cz", (c= 0, in >0).* 


Proof: The necessity of the conditions has already been shown. The 
sufficiency will be shown by a constructive method of realization. In 
fact, we shall show that given a Z, satisfying these conditions, it repre- 
sents the impedance of a transmission line terminated in an impedance 
Z, satisfying the same conditions and of lower order. Z, and Z; are re- 


* A lossless structure will result if c = 0. 
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lated by 
g - SU +VA + Rie" — 1) (51) 
"(8 — 1) ZR + (2-1) 
or 
2ry = Qry 
Zi Sai Se (2 + 1)Z, Ri(z 1) (52) 


(2271 — 1)Z,Ryv + (2241) 
To show that Z, is p-r and of lower order for some positive R, and 
71 we first observe, from (44) through (46), that 


Z,( 0) = Nil0) ~ N20) _ _ Na(0) 
; D,(0) — D2(0) Di(0)’ 





since by condition (2), 


Ni(0) _ D2(0). 
N.(0) ~Di(0)’ 














and 
Z,(0) = Ni(0) + N2(0) = N.2(0) 
; D(0) + D2(0) D1 (0) 
= —Z,(«). (53) 
Next, we observe from (52) that 
f-1 2-1 
pr => => oe poz . (54) 


“1 Zo 
Re Ro 


In the above, both p,’ and po are reflection coefficients. It is obvious 
that if p’ is analytic for | z| = 1 and bounded by one on the unit circle, 
than Z, is p-r. It is also true that if p, is of lower order than po then 
Z;' is of lower order than Z,. We, therefore, let Ri = Z,() and note 
that the highest exponents of z in the numerator and the denominator 
of py are 2im—1 and 2i,, respectively. The denominator of po has no 
constant term and has the lowest exponent of 22, . If we now choose 71 
equal to the lesser of 7, and (im — tm—i), it is assured that p;’ is analytic 
for |z| = 1 and well behaved at infinity. This follows from the ana- 
lyticity of po and the cancellation of 2’. It is also clear from (54) that 
for |z| = 1, | a | S | 0 |. But since Z is p-r, | p | S 1, hence 
| p:’ | is bounded by one. We thus have, Z; is p-r if Z is p-r and the 
order of Z, is, 2(tm, — 71), when the order of Z, is 27 . 
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Next, if we express 
_ Nv +N 
Dy + D 
where Ny’, No, D;’, and D,’ are defined in a similar manner as N, Ne, 
D,, and Dy, in (45), except that 
Ny @) = 2°" Ny (1/z), ete: (56) 
It then follows from (51) and (55) that 


“Ly (55) 


Ni(z) = (2 + 1)My' (2) + Ri (2? — 1) Dy’ (z) 

No(z) = (2 + 1)No'(z) + Ri (2? — 1) Di’ (z) 57) 
D(z) = (2 + 1)Dy'(z) + Rr? (2 — 1)N2' (2) 

Di(z) = (2 +1) D(z) + Rye" — 1M (2). 


From condition (2), 
ceim = N,D, = N2De 
= 42°1(N,'Dy — N2'Dy ) (from (57)). 


| 


Thus, 


, 


0 


IV 


tne In? I Wim 
N, dD, = No De = cgi a Cc 


and the second condition is satisfied. This proves the basic theorem. 


VII. LOSSLESS CASE —— SYNTHESIS 


It is indeed possible to synthesize the cascaded structure using (52) to 
(54) as discussed in the previous section. We present here an algorithm 
for synthesis which is much more straightforward. In our discussion 
here, we shall tacitly assume that the conditions of the realizability 
theorem are satisfied. Given a Z,(z) satisfying the realizability condi- 
tions there exist R; and 71 such that it is the impedance function of a 
transmission line of length 7; and characteristic impedance FR, terminated 
in a realizable impedance function Z; of order lower than that of Z, . Let 


Vee 
Yr 
and 
/ 
Zi = aus 
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then if y is a vector with components {y; , y2}, v; are some vectors, and 
y = Dine, 
k=0 
we have 
y = $(A2" + Ar )y’ 

$(Ar‘y’ 2"! + 3 (Ary’). 

The ratio of elements in the first term-on the right is +, and for the 
second term it is (—R,). The lowest exponent of z in the first term is 


27,. Finally, y’ is obtained by removing the multiplier 2° in the first 
term and adding the terms together since 


4(A* + A’) = I, 


the identity matrix. We thus have a unique algorithm provided we have 
a nondegenerate structure, i.e., the sum of the lengths of any subset of 
the lines is not equal to the sum of the lengths of any other subset. This 
assures us that there are no exponents equal in the two terms above. We 
shall now specify the algorithm. 

Given an impedance function. 


DS bye 
(¢) Separate like and unlike signs of the coefficients a; , by 


Z _ ys ae" a >» Aue 
2 > bye zs > bye’ 


a a 


U ; uw 
i > 0; a < 0. 
(iz) Identify ai/bi = R, and the lowest 7; = 1. 
(iz) Obtain 


Z _ Si ae + > Quz 


The algorithm is repeated until step (277) leads to a constant repre- 
senting the terminating resistor R,. It must be observed that this al- 
gorithm is valid for nondegenerate structures only (i.e., a;/b1 = — (du/bu) 
for all 7 and wu). 
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For degenerate structures, the first step in the algorithm has to be 
modified. It is known, of course, from our discussion of (53) that 


el ee #8 


Bins bo 
So, if for any k, (a:/be) # +R, then we must split a; and b; such that 


Ay = Ari + Aku 


be = ber + Deu 
and 
Akt _ Au 
oe = = R,, 
ber eo. , 
so that 


der = Ribsr = 4 (ae + Ride) 
Qiu = —Ribsu = 3 (Ge — Ribs). 
Using the above, we obtain the modified algorithm: 
(¢) Identify 


Om 
—" = Py. 
Om 


(12) Decompose 


_ Dane + Dane 
a YS due + dew” 
(72) Identify the lowest 7, with nonzero az; = 7. 
(qv) Obtain 


> aye? OY aye’ 
aS bus =f > Diu" ty 


The synthesis method presented here minimizes algebraic operations 
on the coefficients a; and b; , hence it is computationally advantageous. 


VIII. CONCLUSION 


We have presented a formulation which allows us to investigate 
structures involving lossless transmission lines of arbitrary electrical 
lengths. An analysis method is then developed which explicitly expresses 
the system functions in terms of the physical parameters of the system. 
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A basic theorem specifying the physical realizability conditions for such 
structures has been presented together with a computationally simple 
method of synthesis of impedance functions satisfying these conditions. 
The significant characteristic of the results presented so far is the sim- 
plicity of the algorithms involved both for analysis as well as synthesis. 
These algorithms allow one to proceed by inspection in simple problems 
and are most suitable for computer studies when the problems are more 
complex. 

Extensions of the theory to more general transmission lines and lumped 
structures have been carried out. These results as well as design ap- 
proaches to the cascade structures and questions of testing conditions, 
approximations, etc., will be discussed elsewhere. 


APPENDIX 


Maximum Modulus Theorem and Transcendental Functions 


Throughout the text, the maximum modulus theorem’ has been 
applied to functions which have either essential singularities or are 
not single-valued in the domain concerned. Some justification for the 
validity of the theorem for such functions is in order. The theorem has 
been used to imply that the unit bound on the reflection coefficient (or 
the positive reality of the impedance function) on the imaginary axis of 
the s-plane is sufficient to ensure the same throughout the semi-infinite 
right half s-plane. Consider the reflection function 


m 


p> Gn tn 


p(s) = > Um ~ O 


Py bes 
n=0 

where 7, are nonnegative and increasing with n. The above function is, 
of course, assumed to be analytic in the right half-plane. The function 
p(s) is a meromorphic function with infinite singularities, hence the 
point at infinity is an essential singularity. This makes it difficult to 
apply the maximum modulus theorem to the entire right half-plane. The 
transformation z = e° eliminates the essential singularity at infinity 
but makes p(z) multi-valued since 7, are not necessarily integers. If the 
7, are indeed integers, then p(z) is single-valued and the theorem can be 
applied. If the 7, are not integers, they can be approximated arbitrarily 
closely by rational numbers (dense in the field of real numbers) and the 
transformation 2” = e°, where uz, = integer, will yield a single-valued 
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function to which the theorem can be applied. This discussion should 
suffice to justify the use of the maximum modulus theorem for our pur- 
poses. In fact, the theorem can be applied to the function in its original 
s-domain or under any suitable transformation. 
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Electron Phase Contrast Images of 
Molecular Detail 


By R. D. HEIDENREICH 
(Manuscript received March 14, 1966) 


Electron phase contrast images with a resolution of at least 2 A have been 
obtained using a modified commercial electron microscope. A ‘phase col- 
umn” approximation for interpreting such vmages is briefly discussed and 
applied to images of graphite, evaporated carbon, and a synthetic polypep- 
tide. Hexagonal features about & A in diameter are attributed to the graphite 
unit cell imaged by the six first-order prism plane reflections. The image so 
produced is a next-nearest neighbor representation. 


The steady improvement in resolving power of commercial electron 
microscopes over the past few years has re-awakened considerable in- 
terest in the possibilities of directly imaging details of molecular struc- 
ture. In particular, the phase contrast mechanism based on the Abbé 
theory of image formation expressed in terms of the Kirchoff diffraction 
integral has been re-examined using numerical computation methods not 
in wide use when Scherzer! discussed phase image formation. Several 
theoretical papers?**:5 dealing with phase contrast images of atom posi- 
tions have indicated that it should be possible to experimentally obtain 
such images under the right conditions. The computations all assume a 
monolayer specimen in the object plane or effectively a single atom ap- 
proach. This idealized specimen is difficult to realize experimentally and 
for that reason this brief account is concerned with problems and some 
results with actual three-dimensional preparations. 

High resolution Fourier or “lattice” images of near perfect three-di- 
mensional crystals®? about one-fourth extinction distance thick have be- 
come familiar in the last two to three years with image detail exhibited 
down to 1.8 A. These have been obtained with crystals for which the 
total elastic cross section considerably exceeds the inelastic. Tilted il- 
lumination has been used which effectively reduces the spherical aberra- 
tion to zero in one direction. Since it appears that the greatest potential 
value of high resolution microscopy lies in molecular biology, the sub- 
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jects of concern here are carbon and high polymers for which the clastic 
and inelastic cross sections are about the same and the phase contrast 
situation is, thus, not so favorable. 

There are two aspects to high-resolution microscopy: first, an objective 
lens and instrument capable of point-to-point resolution in the range of 
interatomic distances and, second, preparation of specimens and inter- 
pretation of phase contrast images to obtain intelligible information from 
the object; i.e., the ability to use the resolving power. 

The micrographs displayed here were taken with a modified* Siemens 
Elmiskop I having improved stability, reduced ac hum, and a focal length 
shortened from 2.8 to 1.9 mm. The result is a resolving power (point-to- 
point) with axial illumination of at least 2 A. In the present configuration 
the instrument is operated at 80 KV and double condenser with a 200 pu 
condenser aperture to improve the transverse coherence for phase con- 
trast. Images were recorded photographically at 214,000X and always 
in focal sequences at 35 A focal steps. 

In the coherent phase-amplitude approximation-to-phase contrast, the 
intensity | ¥ |? at a point (‘,y*) in the image plane is 


| ¥ieiy) PS Mf +] SP + 287") (1) 


with M the magnification and S"*! the real part of the phase-amplitude 
or Kirchoff imaging integral*:> over the back focal plane. S is the integral 
of the product of several terms,} one of which is sin x with x the phase 
relative to the unscattered axial wave. 


I 
X = Xapr. + xastie. + 5 ASB - (2) 


Here, Xsph. & — (/2d)C;8! is the spherical aberration phase, 8 the scat- 
tering angle, and Af the defocus from the precise Gaussian image condi- 
tion. Since C; = Cf, where f is the focal length of the objective lens, a 
reduction in f reduces the phase distortion due to spherical aberration. 
The phase shift 7/2 due to scattering is removed from (2) and included 
in S. In order that the imaging integral have a useful magnitude for an 
interplanar spacing d = | g |“, the specimen thickness ¢ must be such that 


Vyi/2 Kd (3) 


to hold down the destructive phase summation over the specimen thick- 


* The modifications were volunteered and carried out by Mr. H. Armbruster 
of Siemens-America. The author is grateful to Mr. Armbruster for his skill, effort 
and time in improving the instrument. 

7+ A discussion of the various factors involved with three-dimensional objects 
will be given in a full publication. 
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ness. Generally, the thinner the object, the better the phase contrast 
resolution. Objects 50 A thick or less are desirable which poses problems 
in preparation and mounting techniques. 

An instructive approach to the interpretation of phase contrast detail 
is to employ the column approximation’ used so successfully in diffrac- 
tion contrast. The difference will be in the fact that the transmitted and 
diffracted beams from the column are recombined at the image plane and 
that the column will have definite dimensions and symmetry. These 
dimensions and symmetry are determined by the reciprocal lattice vec- 
tors g accepted by the objective lens and not subject to undue phase 
distortion. Since the angle @ in (2) for a Bragg reflection is 8 = | g |, the 
imaging integral S will have appreciable magnitude in the column for 
values of Af and | g | that bring (2) near 7/2. The column of thickness ¢ 
will thus have a prismatic cross section with dimensions and symmetry 
determined by the g vectors optimized by defocus Af for a given spherical 
aberration coefficient. 

For the case of a thin sheet of graphite normal to the optic axis of the 
objective lens, the electron diffraction pattern consists only of reflections 
(hko). Of these, only the six prism plane reflections at 2.13 A are used by 
the objective lens to produce an image without undue phase distortion. 
Shorter spacings are presently seriously ‘‘garbled” by both spherical 
aberration and uncorrected astigmatism. Micrographs taken with and 
without a 100 u objective aperture, which passes the prism plane reflec- 
tions, are very similar. The result is a phase contrast column of thickness 
t and hexagonal cross section about 5 A in diameter as depicted in Fig. 1. 
The relation between the minimum crystallographic unit cell and the 
column cross section is evident in Fig. 1. Since no information on the 
shorter interatomic distances reaches the image plane, the cross section 
is that for a “‘next-nearest neighbor cell’’. The nearest neighbor cell would 
be essentially a benzene ring shown by the dash lines in Tig. 1. It is noted 
that the information available to the image plane recognizes only four 
atoms in the cross section of the column whereas there are ten carbon 
atoms in the structure. 

The column approach just discussed is highly useful in interpreting 
the micrograph of graphite in Fig. 2. The size of the hexagonal ‘‘cells’”’ 
in Fig. 2 is about 5 Aas expected on the basis of Fig. 1. Inelastic scatter- 
ing and some damage to the thin graphite due to cleaving and mounting 
act to degrade the delineation of the image of the cells. 

The high “‘noise”’ level familiar in evaporated carbon substrates be- 
comes understandable from this point of view. A typical high resolution 
micrograph of a thin carbon substrate is displayed in Fig. 3. In the circled 
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Fig, 1 — Hexagonal array of atoms in a single graphite layer. The unit cell of 
side ao = 2.46 A (¢ = 6.7 A) is shown cross-hatched. The hexagonal cell shown by 


the heavy lines is the one defined by the six prism plane reflections at 2.13 A or 
the next-nearest neighbor cell. 





Fig. 2— Phase contrast micrograph of a cleaved graphite sheet using the prism 
plane reflection and showing the hexagonal cells of Fig. 1 about 5 A in diameter. 


The defocus is about 100 A to the focal length side. (80 KV, 200 » condenser 
aperture. No objective aperture. Electronic magnification 214,000.) 
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Fig. 3— Micrograph of an evaporated carbon substrate displaying the hex- 
agonal cells in the circled area where the c-axis is normal to the sheet. The 


co} 
crystallite size in this region is around 20 A. Neighboring regions are at different 
orientations. 





Siw 


Fig. 4— Micrograph of a thin film (~60 A) of the synthetic polypeptide poly- 
y-benzyl-L-glutamate at lower magnification showing the extent of ordered struc- 
ture seen by phase contrast. Hexagonal cells such as those of Fig. 3 are found 
scattered through the image. 
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region are several hexagonal cells about 5 A in diameter taken to be due 
to a small graphite crystallite only a few cells in extent oriented with the 
c-axis normal to the film. Neighboring areas are composed of other 
crystallites unsuitably oriented to produce the prism plane reflections. 
So called amorphous carbon is well approximated by randomly oriented 
graphite crystallites only a few cells in extent. The structure seen in car- 
bon films is thus actually “graphite noise’’. 

All the polymer films and filaments examined under these conditions 
show phase contrast structure. Although the use of cold surfaces about 
the object holder reduces contamination they do not completely remove 
it. In addition, most polymers suffer radiation damage by 80 IV elec- 
trons which leads to the appearance of a diffuse diffraction ring at about 
21 A. Consequently, there can be some question as to the origin of hexag- 
onal cells seen in the polymers. There is sufficient detail in such images, 
however, differing from that seen in Fig. 3 that the images cannot be 
regarded as just due to carbon. The micrograph in I’ig. 4 is a phase image 
of a thin film of poly-y-benzyl-L-glutamate (a synthetic polypeptide) at 
about half magnification of the preceding figures. The ordered structure 
in the image is evident, but at present the details have not been ex- 
plained. The a-helix is about 12 A in diameter so that a single chain 
should be easily seen if it were isolated. The chains must, therefore, be 
packed together in Fig. 4 and may be cross-linked which greatly in- 
creases the difficulty in interpretation. 

From a number of micrographs such as Fig. 4, it appears that if phase 
contrast high resolution microscopy is to be useful for polymers and 
biological molecules, techniques of preparing specimens to give isolated 
chains must be developed. The phase image of an isolated chain should 
be quite amenable to interpretation using the column concept. 
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Multicolor Holographic Image Reconstruction 
with White-Light Illumination 


By L. H. LIN and K. S. PENNINGTON 
and G. W. STROKE* and A. E. LABEYRIE* 


Color images have been obtained by wavefront reconstruction from a 
reflection volume hologram illuminated with ordinary white light. The 
hologram was recorded with coherent light at two wavelengths, 6328 A 
and 4880 A, from helium-neon and argon-ion lasers, respectively. Fig. 1 
shows the white-light reconstructed image from such a hologram; the 
original subject was a color transparency. The hologram was formed in 
Kodak 649F emulsion. 

A simple method of multicolor holography has previously been re- 
ported.t This method was based upon the formation of volume holo- 
grams which reconstructed by Bragg reflection from the planes formed 
in the emulsion. The wavefronts were reconstructed by illuminating the 
hologram with the same laser light used in recording and were observed 
on transmission through the hologram plate. With beam angles used to 
give transmission, the Kodak 649F emulsion was not thick enough to 
form holograms having the angular and spectral selectivities needed for 
good white-light reconstruction. A simple method for obtaining reflec- 
tion volume holograms was recently described.? It showed that high- 
quality reconstructions could be obtained in a single color by reflection 
of white light from the hologram when, in the recording, the reference 
beam and the subject beam interfered at very large angles (160°-180°). 
Reflection holograms of two- and three-dimensional objects form an 
extension of basic ideas and work by Denisyuk? in his generalization 
of Lippmann color photography? and Gabor holograph.5 

The ability to reconstruct multicolor holograms with white-light il- 
lumination adds a degree of flexibility to holography; we have now 
demonstrated the simplicity of obtaining this result. We have recorded 
reflection holograms both by “projection” and in diffused light, in a 
single color and in multiple colors. Fig. 2 illustrates one of the arrange- 
ments used to record the multicolor hologram. To insure minimum 
shrinkage of the emulsion in processing the hologram, we omitted the 
fixing of the emulsion as suggested by Ives.* Any white light source rang- 


* University of Michigan. 
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Fig. 2— Arrangement for “projection” hologram formation. 


ing from flashlight to sunlight can be used to obtain reconstructions as 
shown in Fig. 1. Particularly brilliant multicolor reconstructions were 
obtained when the light illuminating the subject was focused some 
distance behind the hologram plane. A similar result was reported in 
Ref. 2. 

One of us (G. W. Stroke) wishes to thank Professor D. Gabor for most 
fruitful conversations and encouragement with this work. He also wishes 
to acknowledge the generous support by the National Science Founda- 
tion of the part of the work carried out by him with his students. 
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Fig. 1 (Opposite page) — Early photograph of multicolor holographic image 
reconstructed with white light. 


Note added in proof: The color of this photograph was shifted toward 
the blue. More faithful recording of the reconstructed wavefront has 
been obtained with a better choice of color film and angle of the recon- 
structing illumination. 





